PPO โ€“ ppo-LunarLander-v3

Trained with Stable-Baselines3 (mini-run ~80k steps).

Evaluation (10 episodes): 500.00 ยฑ 0.00

Run: Cartpole-v1__sb3__1759339843

Note: Uploaded without a replay video to avoid Colab rendering issues.

Downloads last month
7
Video Preview
loading

Evaluation results