Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
EssentialAI
's Collections
Essential-Web v1.0
Rethinking Reflection in Pre-Training
Essential-Web v1.0
updated
Jun 18
Upvote
8
Essential-Web v1.0: 24T tokens of organized web data
Paper
•
2506.14111
•
Published
Jun 17
•
43
EssentialAI/essential-web-v1.0
Preview
•
Updated
Jun 22
•
51.7k
•
197
EssentialAI/eai-distill-0.5b
0.6B
•
Updated
Jun 18
•
1.18k
•
22
EssentialAI/eai-taxonomy-math-w-fm
Viewer
•
Updated
Jun 22
•
21.6M
•
3.31k
•
6
EssentialAI/eai-taxonomy-code-w-dclm
Viewer
•
Updated
Jun 22
•
274M
•
5.74k
•
7
EssentialAI/eai-taxonomy-code-w-dclm-100b-sample
Viewer
•
Updated
Jun 22
•
46.2M
•
522
•
2
EssentialAI/eai-taxonomy-med-w-dclm
Viewer
•
Updated
Jun 22
•
81.2M
•
3.41k
•
8
EssentialAI/eai-taxonomy-med-w-dclm-100b-sample
Viewer
•
Updated
Jun 22
•
36.6M
•
1.82k
•
2
EssentialAI/eai-taxonomy-stem-w-dclm
Preview
•
Updated
Jun 22
•
3.54k
•
5
EssentialAI/eai-taxonomy-stem-w-dclm-100b-sample
Viewer
•
Updated
Jun 22
•
35.5M
•
2.47k
•
4
Upvote
8
+4
Share collection
View history
Collection guide
Browse collections