ppo-LunarLander-v2 / results.json
OD's picture
First PPO LunarLander-v2 trained agent
bad4f7e
raw
history blame
163 Bytes
{"mean_reward": 263.93145202448227, "std_reward": 17.7334081950883, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-05-25T20:25:33.403029"}