alexbalandi
/

ppo-LunarLander-v2-4milsteps-200-envs

Reinforcement Learning

stable-baselines3

deep-reinforcement-learning

Model card Files Files and versions

ppo-LunarLander-v2-4milsteps-200-envs / FinetunedPPO_5mil_steps_total

Ctrl+K

Ctrl+K

1 contributor

History: 1 commit

alexbalandi's picture

Upload PPO LunarLander-v2 trained agent, used 1 mil more steps with more loose variance hyperparameter.

3120398 over 2 years ago

_stable_baselines3_version

5 Bytes

Upload PPO LunarLander-v2 trained agent, used 1 mil more steps with more loose variance hyperparameter. over 2 years ago
data

24.1 kB

Upload PPO LunarLander-v2 trained agent, used 1 mil more steps with more loose variance hyperparameter. over 2 years ago
policy.optimizer.pth

88.1 kB
xet

Upload PPO LunarLander-v2 trained agent, used 1 mil more steps with more loose variance hyperparameter. over 2 years ago
policy.pth

43.4 kB
xet

Upload PPO LunarLander-v2 trained agent, used 1 mil more steps with more loose variance hyperparameter. over 2 years ago
pytorch_variables.pth
Pickle imports
- No problematic imports detected
What is a pickle import?
431 Bytes
xet

Upload PPO LunarLander-v2 trained agent, used 1 mil more steps with more loose variance hyperparameter. over 2 years ago
system_info.txt

226 Bytes

Upload PPO LunarLander-v2 trained agent, used 1 mil more steps with more loose variance hyperparameter. over 2 years ago