unit-1-PPO-LunarLander-v2 / mlp_ppo_lunarlander
Vladimir Abramov
Upload PPO agent trained in LunarLander-v2 for Unit 1 Deep-RL Course. Epochs: 500k, Mean Reward: 192 +/- 75
64b3873