shtif
/

LLv2-PPO

Reinforcement Learning

TensorBoard

LunarLander-v2

ppo

deep-reinforcement-learning

custom-implementation

deep-rl-course

Eval Results

Model card Files Files and versions Metrics Training metrics Community

LLv2-PPO / results.json

shtif

Push agent to the Hub

af0fe8c over 1 year ago

raw

history blame contribute delete

173 Bytes

{"env_id": "LunarLander-v2", "mean_reward": 1.0832669532358807, "std_reward": 59.408924317500464, "n_evaluation_episodes": 10, "eval_datetime": "2023-08-07T07:25:59.832631"}