TheoVincent commited on
Commit
caa963d
ยท
1 Parent(s): 7581d93

prettier model card

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -10,16 +10,16 @@ co2_eq_emissions:
10
  ---
11
 
12
  # Model parameters trained with `i-DQN` and `i-IQN`
13
- This repository contains the model parameters trained with `i-DQN` on [56 Atari games](#i-DQN_games) and trained with `i-IQN` on [20 Atari games](#i-IQN_games) ๐ŸŽฎ. 5 seeds are available for each configuration which makes a total of **380 available models** ๐Ÿ“ˆ.
14
 
15
- The [evaluate.ipynb](./evaluate.ipynb) notebook contains a minimal example to evaluate to model parameters ๐Ÿง‘โ€๐Ÿซ. It uses JAX ๐Ÿš€. The hyperparameters used during training are reported in [config.json](./config.json) ๐Ÿ”ง.
16
 
17
- Training code ๐Ÿ‘‰[๐Ÿ’ป](https://github.com/theovincent/i-DQN)
18
 
19
  ps: The set of [20 Atari games](#i-DQN_games) is included in the set of [56 Atari games](#i-IQN_games).
20
 
21
  ### Model performances
22
- | <div style="width:300px; font-size: 30px; font-family:Serif; font-name:Times New Roman" > **i-DQN** and **i-IQN** are improvements of [DQN](https://www.nature.com/articles/nature14236.pdf) and [IQN](https://arxiv.org/abs/1806.06923) published at TMLR โœจ. <br> Link ๐Ÿ‘‰[๐Ÿ“„](https://arxiv.org/abs/2403.02107) </br> <div style="font-size: 16px"> <details> <summary id=i-DQN_games>List of games trained with `i-DQN` </summary> *Alien, Amidar, Assault, Asterix, Asteroids, Atlantis, BankHeist, BattleZone, BeamRider, Berzerk, Bowling, Boxing, Breakout, Centipede, ChopperCommand, CrazyClimber, DemonAttack, DoubleDunk, Enduro, FishingDerby, Freeway, Frostbite, Gopher, Gravitar, Hero, IceHockey, Jamesbond, Kangaroo, Krull, KungFuMaster, MontezumaRevenge, MsPacman, NameThisGame, Phoenix, Pitfall, Pong, Pooyan, PrivateEye, Qbert, Riverraid, RoadRunner, Robotank, Seaquest, Skiing, Solaris, SpaceInvaders, StarGunner, Tennis, TimePilot, Tutankham, UpNDown, Venture, VideoPinball, WizardOfWor, YarsRevenge, Zaxxon.* </details> <details> <summary id=i-IQN_games>List of games trained with `i-IQN`</summary> *Alien, Assault, BankHeist, Berzerk, Breakout, Centipede, ChopperCommand, DemonAttack, Enduro, Frostbite, Gopher, Gravitar, IceHockey, Jamesbond, Krull, KungFuMaster, Riverraid, Seaquest, Skiing, StarGunner.* </details> </div> </div> | <img src="performances.png" alt="drawing" width="600px"/> |
23
  | :-: | :-: |
24
 
25
  ## User installation
 
10
  ---
11
 
12
  # Model parameters trained with `i-DQN` and `i-IQN`
13
+ This repository contains the model parameters trained with `i-DQN` on [56 Atari games](#i-DQN_games) and trained with `i-IQN` on [20 Atari games](#i-IQN_games) ๐ŸŽฎ 5 seeds are available for each configuration which makes a total of **380 available models** ๐Ÿ“ˆ
14
 
15
+ The [evaluate.ipynb](./evaluate.ipynb) notebook contains a minimal example to evaluate to model parameters ๐Ÿง‘โ€๐Ÿซ It uses JAX ๐Ÿš€ The hyperparameters used during training are reported in [config.json](./config.json) ๐Ÿ”ง
16
 
17
+ To the training code ๐Ÿ‘‰[๐Ÿ’ป](https://github.com/theovincent/i-DQN)
18
 
19
  ps: The set of [20 Atari games](#i-DQN_games) is included in the set of [56 Atari games](#i-IQN_games).
20
 
21
  ### Model performances
22
+ | <div style="width:300px; font-size: 30px; font-family:Serif; font-name:Times New Roman" > **i-DQN** and **i-IQN** are improvements of [DQN](https://www.nature.com/articles/nature14236.pdf) and [IQN](https://arxiv.org/abs/1806.06923). <br> Published at [TMLR](https://arxiv.org/abs/2403.02107)โœจ </br> <div style="font-size: 16px"> <details> <summary id=i-DQN_games>List of games trained with `i-DQN` </summary> *Alien, Amidar, Assault, Asterix, Asteroids, Atlantis, BankHeist, BattleZone, BeamRider, Berzerk, Bowling, Boxing, Breakout, Centipede, ChopperCommand, CrazyClimber, DemonAttack, DoubleDunk, Enduro, FishingDerby, Freeway, Frostbite, Gopher, Gravitar, Hero, IceHockey, Jamesbond, Kangaroo, Krull, KungFuMaster, MontezumaRevenge, MsPacman, NameThisGame, Phoenix, Pitfall, Pong, Pooyan, PrivateEye, Qbert, Riverraid, RoadRunner, Robotank, Seaquest, Skiing, Solaris, SpaceInvaders, StarGunner, Tennis, TimePilot, Tutankham, UpNDown, Venture, VideoPinball, WizardOfWor, YarsRevenge, Zaxxon.* </details> <details> <summary id=i-IQN_games>List of games trained with `i-IQN`</summary> *Alien, Assault, BankHeist, Berzerk, Breakout, Centipede, ChopperCommand, DemonAttack, Enduro, Frostbite, Gopher, Gravitar, IceHockey, Jamesbond, Krull, KungFuMaster, Riverraid, Seaquest, Skiing, StarGunner.* </details> </div> </div> | <img src="performances.png" alt="drawing" width="600px"/> |
23
  | :-: | :-: |
24
 
25
  ## User installation