tim-lawson commited on
Commit
c847acb
·
verified ·
1 Parent(s): bcd5ce9

Push model using huggingface_hub.

Browse files
Files changed (1) hide show
  1. README.md +37 -3
README.md CHANGED
@@ -1,9 +1,43 @@
1
  ---
 
 
 
2
  tags:
 
3
  - model_hub_mixin
4
  - pytorch_model_hub_mixin
 
 
 
5
  ---
6
 
7
- This model has been pushed to the Hub using the [PytorchModelHubMixin](https://huggingface.co/docs/huggingface_hub/package_reference/mixins#huggingface_hub.PyTorchModelHubMixin) integration:
8
- - Library: [More Information Needed]
9
- - Docs: [More Information Needed]
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ language: en
3
+ library_name: mlsae
4
+ license: mit
5
  tags:
6
+ - arxiv:2409.04185
7
  - model_hub_mixin
8
  - pytorch_model_hub_mixin
9
+ expansion_factor: 64
10
+ base_model: EleutherAI/pythia-70m-deduped
11
+ k: 32
12
  ---
13
 
14
+ # Model Card for
15
+
16
+ A Multi-Layer Sparse Autoencoder (MLSAE) trained on the residual stream activation
17
+ vectors from [](https://huggingface.co/) with an
18
+ expansion factor of \(R = \) and sparsity \(k = \),
19
+ over 1 billion tokens from [monology/pile-uncopyrighted](https://huggingface.co/datasets/monology/pile-uncopyrighted).
20
+
21
+ ### Model Sources
22
+
23
+ - **Repository:** <https://github.com/tim-lawson/mlsae>
24
+ - **Paper:** <https://arxiv.org/abs/2409.04185>
25
+ - **Weights & Biases:** <https://wandb.ai/timlawson-/mlsae>
26
+
27
+ ## Citation
28
+
29
+ **BibTeX:**
30
+
31
+ @misc{lawson_residual_2024,
32
+ title = {Residual {{Stream Analysis}} with {{Multi-Layer SAEs}}},
33
+ author = {Lawson, Tim and Farnik, Lucy and Houghton, Conor and Aitchison, Laurence},
34
+ year = {2024},
35
+ month = oct,
36
+ number = {arXiv:2409.04185},
37
+ eprint = {2409.04185},
38
+ primaryclass = {cs},
39
+ publisher = {arXiv},
40
+ doi = {10.48550/arXiv.2409.04185},
41
+ urldate = {2024-10-08},
42
+ archiveprefix = {arXiv}
43
+ }