Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
shtif
/
LLv2-PPO
like
0
Reinforcement Learning
TensorBoard
LunarLander-v2
ppo
deep-reinforcement-learning
custom-implementation
deep-rl-course
Eval Results
Model card
Files
Files and versions
Metrics
Training metrics
Community
main
LLv2-PPO
/
results.json
shtif
Push agent to the Hub
af0fe8c
over 1 year ago
raw
Copy download link
history
blame
contribute
delete
Safe
173 Bytes
{
"env_id"
:
"LunarLander-v2"
,
"mean_reward"
:
1.0832669532358807
,
"std_reward"
:
59.408924317500464
,
"n_evaluation_episodes"
:
10
,
"eval_datetime"
:
"2023-08-07T07:25:59.832631"
}