Single SAEs trained on the residual stream activation vectors from every transformer layer simultaneously: https://arxiv.org/abs/2409.04185
Tim Lawson
tim-lawson
·
AI & ML interests
Mechanistic interpretability, language modelling, semantics
Recent Activity
updated
a model
3 days ago
tim-lawson/sae-pythia-160m-deduped-x64-k32-control
updated
a model
3 days ago
tim-lawson/sae-pythia-160m-deduped-x64-k32-embed
updated
a model
3 days ago
tim-lawson/sae-pythia-160m-deduped-x64-k32-randomized
Organizations
None yet
Collections
6
Papers
1
models
206
tim-lawson/sae-pythia-160m-deduped-x64-k32-control
Updated
•
4
tim-lawson/sae-pythia-160m-deduped-x64-k32-embed
Updated
•
4
tim-lawson/sae-pythia-160m-deduped-x64-k32-randomized
Updated
•
4
tim-lawson/sae-pythia-160m-deduped-x64-k32-step0
Updated
•
4
tim-lawson/sae-pythia-160m-deduped-x64-k32-trained
Updated
•
4
tim-lawson/mlsae-gemma-2-2b-x64-k32
Updated
•
8
tim-lawson/mlsae-gemma-2-2b-x64-k32-tfm
Updated
•
16
tim-lawson/mlsae-pythia-2.8b-deduped-x64-k32
Updated
•
5
tim-lawson/mlsae-pythia-2.8b-deduped-x64-k32-tfm
Updated
•
4
tim-lawson/mlsae-pythia-410m-deduped-x64-k32-tfm
Updated
•
41
datasets
60
tim-lawson/mlsae-Llama-3.2-3B-x64-k32-dists
Viewer
•
Updated
•
197k
•
52
tim-lawson/mlsae-gemma-2-2b-x64-k32-dists
Viewer
•
Updated
•
147k
•
53
tim-lawson/mlsae-gpt2-x64-k32-dists
Preview
•
Updated
•
32
tim-lawson/sae-pythia-160m-deduped-x64-k32-tfm-layers-11-dists
Viewer
•
Updated
•
49.2k
•
32
tim-lawson/sae-pythia-160m-deduped-x64-k32-tfm-layers-10-dists
Viewer
•
Updated
•
49.2k
•
31
tim-lawson/sae-pythia-160m-deduped-x64-k32-tfm-layers-8-dists
Viewer
•
Updated
•
49.2k
•
32
tim-lawson/sae-pythia-160m-deduped-x64-k32-tfm-layers-9-dists
Viewer
•
Updated
•
49.2k
•
32
tim-lawson/sae-pythia-160m-deduped-x64-k32-tfm-layers-7-dists
Viewer
•
Updated
•
49.2k
•
33
tim-lawson/sae-pythia-160m-deduped-x64-k32-tfm-layers-5-dists
Viewer
•
Updated
•
49.2k
•
33
tim-lawson/sae-pythia-160m-deduped-x64-k32-tfm-layers-6-dists
Viewer
•
Updated
•
49.2k
•
32