Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
@@ -50,12 +50,12 @@ If you want to contact us & join us, you can βοΈ to our team : <opendilab@p
|
|
50 |
| Algo.\Env. | [CartPole](https://di-engine-docs.readthedocs.io/en/latest/13_envs/cartpole.html) | [LunarLander](https://di-engine-docs.readthedocs.io/en/latest/13_envs/lunarlander.html) | [LunarLanderContinuous](https://di-engine-docs.readthedocs.io/en/latest/13_envs/lunarlander.html) | [Pendulum](https://di-engine-docs.readthedocs.io/en/latest/13_envs/pendulum.html) | [Pong](https://di-engine-docs.readthedocs.io/en/latest/13_envs/atari.html) | [Breakout](https://di-engine-docs.readthedocs.io/en/latest/13_envs/atari.html) | [MsPacman](https://di-engine-docs.readthedocs.io/en/latest/13_envs/atari.html) | []() | []() | []() |
|
51 |
| :-------------: | :-------------: | :-------------: | :------------------------: | :------------: | :--------------: | :------------: | :------------------: | :---------: | :---------: | :---------: |
|
52 |
| [AlphaZero](https://www.science.org/doi/10.1126/science.aar6404) | | | | | | | | | | |
|
53 |
-
| [Sampled AlphaZero]() | | | | | | | | | | |
|
54 |
| [Muzero](https://arxiv.org/abs/1911.08265) | [β
](https://huggingface.co/OpenDILabCommunity/CartPole-v0-MuZero) | | | | [β
](https://huggingface.co/OpenDILabCommunity/PongNoFrameskip-v4-MuZero) | | [β
](https://huggingface.co/OpenDILabCommunity/MsPacmanNoFrameskip-v4-MuZero) | | | |
|
55 |
| [EfficientZero](https://arxiv.org/abs/2111.00210) | | | | | | | | | | |
|
56 |
| [Gumbel MuZero](https://openreview.net/pdf?id=bERaNdoegnO&) | | | | | | | | | | |
|
57 |
-
| [Sampled EfficientZero]() | | | | | | | | | | |
|
58 |
-
| [Stochastic MuZero]() | | | | | | | | | | |
|
59 |
|
60 |
</details>
|
61 |
|
|
|
50 |
| Algo.\Env. | [CartPole](https://di-engine-docs.readthedocs.io/en/latest/13_envs/cartpole.html) | [LunarLander](https://di-engine-docs.readthedocs.io/en/latest/13_envs/lunarlander.html) | [LunarLanderContinuous](https://di-engine-docs.readthedocs.io/en/latest/13_envs/lunarlander.html) | [Pendulum](https://di-engine-docs.readthedocs.io/en/latest/13_envs/pendulum.html) | [Pong](https://di-engine-docs.readthedocs.io/en/latest/13_envs/atari.html) | [Breakout](https://di-engine-docs.readthedocs.io/en/latest/13_envs/atari.html) | [MsPacman](https://di-engine-docs.readthedocs.io/en/latest/13_envs/atari.html) | []() | []() | []() |
|
51 |
| :-------------: | :-------------: | :-------------: | :------------------------: | :------------: | :--------------: | :------------: | :------------------: | :---------: | :---------: | :---------: |
|
52 |
| [AlphaZero](https://www.science.org/doi/10.1126/science.aar6404) | | | | | | | | | | |
|
53 |
+
| [Sampled AlphaZero](https://www.science.org/doi/10.1126/science.aar6404) | | | | | | | | | | |
|
54 |
| [Muzero](https://arxiv.org/abs/1911.08265) | [β
](https://huggingface.co/OpenDILabCommunity/CartPole-v0-MuZero) | | | | [β
](https://huggingface.co/OpenDILabCommunity/PongNoFrameskip-v4-MuZero) | | [β
](https://huggingface.co/OpenDILabCommunity/MsPacmanNoFrameskip-v4-MuZero) | | | |
|
55 |
| [EfficientZero](https://arxiv.org/abs/2111.00210) | | | | | | | | | | |
|
56 |
| [Gumbel MuZero](https://openreview.net/pdf?id=bERaNdoegnO&) | | | | | | | | | | |
|
57 |
+
| [Sampled EfficientZero](https://arxiv.org/abs/2104.06303) | | | | | | | | | | |
|
58 |
+
| [Stochastic MuZero](https://openreview.net/pdf?id=X6D9bAHhBQ1) | | | | | | | | | | |
|
59 |
|
60 |
</details>
|
61 |
|