ppo-LunarLander-v2 / results.json
zhiweiyoung's picture
larger batch size
0062905
raw
history blame
164 Bytes
{"mean_reward": 240.42937120000002, "std_reward": 20.03850893843122, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2024-01-06T16:35:47.301453"}