Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
pietrolesci
's Collections
UnimixLM
Interesting Pre-Training Datasets
The Pile Companion
Generalisation-Profiles
Machine Translation Datasets
Text Classification Datasets
Dialogue State Tracking Datasets
NLI Eval Datasets
AnchorAL
Memorisation-Profiles
Tokenisation-Bias
Generalisation-Profiles
updated
Mar 17
Upvote
-
pietrolesci/pile-deduped-pythia-tokfreq
Viewer
•
Updated
Mar 17
•
50.1k
•
5
pietrolesci/pile-deduped-pythia-preshuffled
Viewer
•
Updated
Mar 25
•
244M
•
586
pietrolesci/pile-validation
Viewer
•
Updated
Apr 9
•
429k
•
192
Upvote
-
Share collection
View history
Collection guide
Browse collections