PPO Agent playing Walker2DBulletEnv-v0
This is a trained model of a PPO agent playing Walker2DBulletEnv-v0 using the stable-baselines3 library.
Usage (with Stable-baselines3)
- Downloads last month
- 0
Evaluation results
- mean_reward on Walker2DBulletEnv-v0self-reported1968.90 +/- 16.24