ppo-LunarLander-v2 / results.json
Facepalm0's picture
My second test version of PPO LunarLander-v2 trained agent
390c3c3 verified
{"mean_reward": 288.8178156, "std_reward": 20.275310901033837, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2024-01-25T00:27:11.547572"}