Spaces:
Running
Running
Commit
·
4d5391a
1
Parent(s):
8f8a2a0
style(nyz): add naive model zoo table
Browse files
README.md
CHANGED
@@ -20,3 +20,30 @@ As an important part of OpenXLab from Shanghai AI Laboratory, OpenDILab features
|
|
20 |
OpenDILab contributes to the integration of the latest and most comprehensive achievements in academia as well as the standardization of complex problems in the industry. Our future vision is to promote the development of AI **from perceptual intelligence to decision intelligence,** taking AI technology to a higher level of the general intelligence era.
|
21 |
|
22 |
If you want to contact us & join us, you can ✉️ to our team : <[email protected]>.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
20 |
OpenDILab contributes to the integration of the latest and most comprehensive achievements in academia as well as the standardization of complex problems in the industry. Our future vision is to promote the development of AI **from perceptual intelligence to decision intelligence,** taking AI technology to a higher level of the general intelligence era.
|
21 |
|
22 |
If you want to contact us & join us, you can ✉️ to our team : <[email protected]>.
|
23 |
+
|
24 |
+
|
25 |
+
# Overview of Model Zoo
|
26 |
+
|
27 |
+
## Deep Reinforcement Learning
|
28 |
+
|
29 |
+
| Algo.\Env. | LunarLander | BipedalWalker | Pendulum | Atari (Pong) | Atari (SpaceInvaders) | Atari (Qbert) | MuJoCo (Hopper) | MuJoCo (Halfcheetah) | MuJoCo (Walker2d) |
|
30 |
+
| ------------- | ------------- | ------------------------ | ------------ | -------------- | ------------ | ------------------ | --------- | --------- | --------- |
|
31 |
+
| [PPO](https://arxiv.org/abs/1707.06347) | [Model](https://huggingface.co/OpenDILabCommunity/LunarLander-v2-ppo) | | | | | | | | |
|
32 |
+
|
33 |
+
## Multi-Agent Reinforcement Learning
|
34 |
+
<details close>
|
35 |
+
<summary>(Click for Details)</summary>
|
36 |
+
TBD
|
37 |
+
</details>
|
38 |
+
|
39 |
+
## Offline Reinforcement Learning
|
40 |
+
<details close>
|
41 |
+
<summary>(Click for Details)</summary>
|
42 |
+
TBD
|
43 |
+
</details>
|
44 |
+
|
45 |
+
## Model-Based Reinforcement Learning
|
46 |
+
<details close>
|
47 |
+
<summary>(Click for Details)</summary>
|
48 |
+
TBD
|
49 |
+
</details>
|