File size: 1,785 Bytes
a986605 2b7b26d a986605 2b7b26d a986605 6d6ae42 a613750 c7ba506 a613750 6d6ae42 a613750 a986605 a613750 a986605 2b7b26d a986605 a613750 a986605 91e6ac1 a986605 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 |
---
language: en
tags:
- machine-learning
- reinforcement-learning
- sokoban
- planning
license: apache-2.0
---
# Trained learned planners
This repository contains the trained networks from the paper ["Planning behavior in a recurrent neural network that
plays Sokoban"](https://openreview.net/forum?id=T9sB3S2hok), presented at the ICML 2024 Mechanistic Interpretability
Workshop.
To load and use the NNs, please refer to the [learned-planner
repository](http://github.com/alignmentresearch/learned-planner), and possibly to the [training code
](https://github.com/AlignmentResearch/train-learned-planner).
# Model details
## Hyperparameters:
See `model/*/cp_*/cfg.json` for the hyperparameters that were used to train a particular run.
## Best Models:
The best models for each of the model type are stored in the following directory:
| Model | Directory | Parameter Count |
|:-------|:-----------|:-----------------|
| DRC(3, 3) | `drc33/bkynosqi/cp_2002944000` | 1,285,125 (1.29M) |
| DRC(1, 1) | `drc11/eue6pax7/cp_2002944000` | 987,525 (0.99M) |
| ResNet | `resnet/syb50iz7/cp_2002944000` | 3,068,421 (3.07M) |
## Probes & SAEs:
The trained probes and SAEs are stored in the `probes` and `saes` directories, respectively.
## Training dataset:
The [Boxoban set of levels by DeepMind](https://github.com/google-deepmind/boxoban-levels).
# Citation
If you use any of these artifacts, please cite our work:
```bibtex
@inproceedings{garriga-alonso2024planning,
title={Planning behavior in a recurrent neural network that plays Sokoban},
author={Adri{\`a} Garriga-Alonso and Mohammad Taufeeque and Adam Gleave},
booktitle={ICML 2024 Workshop on Mechanistic Interpretability},
year={2024},
url={https://openreview.net/forum?id=T9sB3S2hok}
}
```
|