Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

alexbalandi
/
ppo-LunarLander-v2-4milsteps-200-envs

Reinforcement Learning
stable-baselines3
LunarLander-v2
deep-reinforcement-learning
Eval Results
Model card Files Files and versions
xet
Community
ppo-LunarLander-v2-4milsteps-200-envs / FinetunedPPO_5mil_steps_total
Ctrl+K
Ctrl+K
  • 1 contributor
History: 1 commit
alexbalandi's picture
alexbalandi
Upload PPO LunarLander-v2 trained agent, used 1 mil more steps with more loose variance hyperparameter.
3120398 over 2 years ago
  • _stable_baselines3_version
    5 Bytes
    Upload PPO LunarLander-v2 trained agent, used 1 mil more steps with more loose variance hyperparameter. over 2 years ago
  • data
    24.1 kB
    Upload PPO LunarLander-v2 trained agent, used 1 mil more steps with more loose variance hyperparameter. over 2 years ago
  • policy.optimizer.pth
    88.1 kB
    xet
    Upload PPO LunarLander-v2 trained agent, used 1 mil more steps with more loose variance hyperparameter. over 2 years ago
  • policy.pth
    43.4 kB
    xet
    Upload PPO LunarLander-v2 trained agent, used 1 mil more steps with more loose variance hyperparameter. over 2 years ago
  • pytorch_variables.pth

    Pickle imports

    • No problematic imports detected

    What is a pickle import?

    431 Bytes
    xet
    Upload PPO LunarLander-v2 trained agent, used 1 mil more steps with more loose variance hyperparameter. over 2 years ago
  • system_info.txt
    226 Bytes
    Upload PPO LunarLander-v2 trained agent, used 1 mil more steps with more loose variance hyperparameter. over 2 years ago