File size: 1,785 Bytes
a986605
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2b7b26d
a986605
2b7b26d
a986605
6d6ae42
 
 
a613750
c7ba506
a613750
 
 
6d6ae42
a613750
a986605
a613750
a986605
2b7b26d
 
 
 
a986605
 
a613750
a986605
 
91e6ac1
 
 
 
 
 
a986605
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
---
language: en
tags:
- machine-learning
- reinforcement-learning
- sokoban
- planning
license: apache-2.0
---

# Trained learned planners

This repository contains the trained networks from the paper ["Planning behavior in a recurrent neural network that
plays Sokoban"](https://openreview.net/forum?id=T9sB3S2hok), presented at the ICML 2024 Mechanistic Interpretability
Workshop.

To load and use the NNs, please refer to the [learned-planner
repository](http://github.com/alignmentresearch/learned-planner), and possibly to the [training code
](https://github.com/AlignmentResearch/train-learned-planner).

# Model details

## Hyperparameters:

See `model/*/cp_*/cfg.json` for the hyperparameters that were used to train a particular run.

## Best Models:

The best models for each of the model type are stored in the following directory:
| Model | Directory | Parameter Count |
|:-------|:-----------|:-----------------|
| DRC(3, 3)  | `drc33/bkynosqi/cp_2002944000` | 1,285,125 (1.29M) |
| DRC(1, 1)  | `drc11/eue6pax7/cp_2002944000` | 987,525 (0.99M)  |
| ResNet     | `resnet/syb50iz7/cp_2002944000` | 3,068,421 (3.07M) |

## Probes & SAEs:

The trained probes and SAEs are stored in the `probes` and `saes` directories, respectively.

## Training dataset:

The [Boxoban set of levels by DeepMind](https://github.com/google-deepmind/boxoban-levels).

# Citation

If you use any of these artifacts, please cite our work:

```bibtex
@inproceedings{garriga-alonso2024planning,
    title={Planning behavior in a recurrent neural network that plays Sokoban},
    author={Adri{\`a} Garriga-Alonso and Mohammad Taufeeque and Adam Gleave},
    booktitle={ICML 2024 Workshop on Mechanistic Interpretability},
    year={2024},
    url={https://openreview.net/forum?id=T9sB3S2hok}
}
```