Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
minpeter
's Collections
[Dataset] K-Corpus
[Dataset] FineWeb2 Edu Korean
[Model] Very, very small things
[Dataset] Pretrain-corpus
[Model] en-ko trans
[Dataset] Candidate datasets to translate
[Dataset] common-pile korean (Filtered-raw)
[Dataset] PR
[Study] NN MNIST
[Model] FLUX.1 Full Finetuned & Merged
[🛠️] Huggingface Utility
[Dataset] unified standard function calling
[tokenizer] AlternateTokenizer
[Dataset] Function Calling
[Dataset] Pretrain-corpus
updated
28 days ago
Upvote
-
PleIAs/common_corpus
Viewer
•
Updated
Jun 10
•
470M
•
10.4k
•
305
EssentialAI/essential-web-v1.0
Preview
•
Updated
Jun 22
•
50.6k
•
197
HuggingFaceFW/fineweb
Viewer
•
Updated
Jul 11
•
52.5B
•
331k
•
2.31k
HuggingFaceFW/fineweb-edu
Viewer
•
Updated
Jul 11
•
3.5B
•
84k
•
735
HuggingFaceFW/fineweb-2
Viewer
•
Updated
Jun 27
•
5.02B
•
69.3k
•
619
data-is-better-together/fineweb-c
Viewer
•
Updated
Jul 8
•
88.7k
•
371
•
54
allenai/dolmino-mix-1124
Viewer
•
Updated
Dec 17, 2024
•
165M
•
24.1k
•
69
allenai/dolma
Updated
Apr 17, 2024
•
669
•
930
allenai/olmo-mix-1124
Viewer
•
Updated
Jul 15
•
620M
•
25.3k
•
70
mlfoundations/dclm-baseline-1.0
Preview
•
Updated
Jul 22, 2024
•
73.4k
•
231
Zyphra/Zyda-2
Preview
•
Updated
12 days ago
•
33.2k
•
83
Upvote
-
Share collection
View history
Collection guide
Browse collections