metadata
tags:
- PongNoFrameskip-v4
- ppo
- reinforcement-learning
- stable-baselines3
- deep-rl
- atari
model-index:
- name: PPO Pong
results:
- task:
type: reinforcement-learning
name: Pong
dataset:
name: PongNoFrameskip-v4
type: atari
metrics:
- name: Mean Reward
type: mean_reward
value: 20.40 +/- 0.92
PPO Agent playing PongNoFrameskip-v4
This is a trained model of a PPO agent playing PongNoFrameskip-v4.
To learn to use this model and train yours, check the Deep Reinforcement Learning Course on Hugging Face.