sevlabr
/

unit-1-PPO-LunarLander-v2

Vladimir Abramov

Upload PPO agent trained in LunarLander-v2 for Unit 1 Deep-RL Course. Epochs: 500k, Mean Reward: 192 +/- 75

64b3873 over 2 years ago

193 Bytes

	OS: Linux-5.4.188+-x86_64-with-Ubuntu-18.04-bionic #1 SMP Sun Apr 24 10:03:06 PDT 2022
	Python: 3.7.13
	Stable-Baselines3: 1.5.0
	PyTorch: 1.11.0+cu113
	GPU Enabled: True
	Numpy: 1.21.6
	Gym: 0.21.0