unit-1-PPO-LunarLander-v2 / mlp_ppo_lunarlander /_stable_baselines3_version
Vladimir Abramov
Upload PPO agent trained in LunarLander-v2 for Unit 1 Deep-RL Course. Epochs: 500k, Mean Reward: 192 +/- 75
64b3873
1.5.0