arxiv:2501.16496
Neel Nanda
NeelNanda
AI & ML interests
Mechanistic Interpretability
Recent Activity
authored
a paper
6 days ago
Open Problems in Mechanistic Interpretability
authored
a paper
2 months ago
Do I Know This Entity? Knowledge Awareness and Hallucinations in
Language Models
updated
a model
3 months ago
NeelNanda/crosscoders-gpt2-small
Organizations
Papers
11
models
65
NeelNanda/crosscoders-gpt2-small
Updated
•
5
NeelNanda/GELU_1L512W_C4_Code
Updated
•
1.77k
•
2
NeelNanda/gpt-neox-tokenizer-digits
Updated
•
2
NeelNanda/sparse_autoencoder
Updated
•
3
NeelNanda/redwood-attn-only-2l
Updated
•
5
NeelNanda/Othello-GPT-Transformer-Lens
Updated
NeelNanda/full_pred_log_probs
Updated
NeelNanda/SoLU_1L256W_C4_Width_Scan
Updated
•
4
NeelNanda/SoLU_1L128W_C4_Width_Scan
Updated
•
3
NeelNanda/SoLU_1L64W_C4_Width_Scan
Updated
•
3
datasets
15
NeelNanda/pile-small-tokenized-2b
Viewer
•
Updated
•
10.8M
•
1.22k
NeelNanda/pile-tokenized-10b
Viewer
•
Updated
•
10.8M
•
199
•
1
NeelNanda/openwebtext-tokenized-9b
Viewer
•
Updated
•
8.83M
•
286
NeelNanda/code-10k
Viewer
•
Updated
•
10k
•
60
•
1
NeelNanda/wiki-10k
Viewer
•
Updated
•
10k
•
53
NeelNanda/c4-code-20k
Viewer
•
Updated
•
20k
•
99
•
4
NeelNanda/c4-10k
Viewer
•
Updated
•
10k
•
427
NeelNanda/c4-tokenized-2b
Viewer
•
Updated
•
1.36M
•
273
NeelNanda/code-tokenized
Viewer
•
Updated
•
297k
•
56
NeelNanda/c4-code-tokenized-2b
Viewer
•
Updated
•
1.66M
•
75
•
1