thatgeeman
/

PPO-lunarlanderv2-hfRLU1

Reinforcement Learning

stable-baselines3

deep-reinforcement-learning

Model card Files Files and versions Community

thatgeeman commited on Mar 8, 2023

Commit

03ec795

•

1 Parent(s): 6ea878a

Update README.md

Files changed (1) hide show

README.md +20 -0

README.md CHANGED Viewed

@@ -25,6 +25,26 @@ model-index:
 This is a trained model of a **PPO** agent playing **LunarLander-v2**
 using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
 ## Usage (with Stable-baselines3)
 TODO: Add your code

 This is a trained model of a **PPO** agent playing **LunarLander-v2**
 using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
+## Model parameters
+```python
+model = PPO(
+    policy = 'MlpPolicy',
+    env = env,
+    n_steps = 1024,
+    batch_size = 64,
+    n_epochs = 10,
+    gamma = 0.999,
+    gae_lambda = 0.98,
+    ent_coef = 0.01,
+    verbose=1)
+```
+Trained for 10^6 steps using
+```python
+steps = 1e6
+model.learn(total_timesteps=int(steps))
+```
 ## Usage (with Stable-baselines3)
 TODO: Add your code