Spaces:

acozma
/

CS581-Algos-Demo

Sleeping

Andrei Cozma commited on Apr 19, 2023

Commit

0c8eecb

1 Parent(s): 879176c

Updates

Files changed (3) hide show

MonteCarloAgent.py CHANGED Viewed

@@ -116,7 +116,6 @@ class MonteCarloAgent:
             if e % test_every == 0:
                 test_success_rate = self.test(verbose=False, **kwargs)
                 if log_wandb:
                     self.wandb_log_img(episode=e)

             if e % test_every == 0:
                 test_success_rate = self.test(verbose=False, **kwargs)
                 if log_wandb:
                     self.wandb_log_img(episode=e)

README.md CHANGED Viewed

@@ -4,9 +4,15 @@
 Evolution of Reinforcement Learning methods from pure Dynamic Programming-based methods to Monte Carlo methods + Bellman Optimization Comparison
 ## Monte-Carlo Agent
-The implementation of the epsilon-greedy Monte-Carlo agent for the [Cliff Walking](https://gymnasium.farama.org/environments/toy_text/cliff_walking/) toy environment.
 ### Training

 Evolution of Reinforcement Learning methods from pure Dynamic Programming-based methods to Monte Carlo methods + Bellman Optimization Comparison
+## Requirements
+- Python 3
+- Gymnasium: <https://pypi.org/project/gymnasium/>
+- WandB: <https://pypi.org/project/wandb/> (optional for logging)
 ## Monte-Carlo Agent
+The implementation of the epsilon-greedy Monte-Carlo agent for the [Cliff Walking](https://gymnasium.farama.org/environments/toy_text/cliff_walking/) toy environment as part of Gymnasium.
 ### Training

policy_mc_CliffWalking-v0_e2000_s500_g0.99_e0.1.npy CHANGED Viewed

Binary files a/policy_mc_CliffWalking-v0_e2000_s500_g0.99_e0.1.npy and b/policy_mc_CliffWalking-v0_e2000_s500_g0.99_e0.1.npy differ