Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
timaeus
's Collections
Datasets: Pile Subsets
Projects: Finetuning
Project: Lang2
Project: Lang1
Project: ICL1
Models: dh
Models: H-dh
Models: H
Models: L
Models: dm
Datasets: Suffixes
Datasets: Prefixes
Datasets: Delimiters
Datasets: Currencies
Datasets: Other
Models: H-dh
updated
Oct 18
Attention-only transformers, sweep over number of heads (variable head dimension)
Upvote
-
timaeus/H1-dh256
Updated
Oct 18
•
2
timaeus/H2-dh128
Updated
Oct 18
•
5
timaeus/H4-dh64
Updated
Oct 18
•
5
timaeus/L2
Updated
Oct 18
•
6
timaeus/H16-dh16
Updated
Oct 17
•
3
timaeus/H32-dh8
Updated
Oct 17
•
3
timaeus/H64-dh4
Updated
Oct 17
•
2
Upvote
-
Share collection
View history
Collection guide
Browse collections