Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
heegyu
's Collections
Korean Reward Modeling
Korean Pretraining Dataset
AjouBlue GPTs
Datasets Translated to Korean
Synthetic Dataset
RLHF papers
Reward Modeling Datasets
Pre-training Dataset
Vision LM
Image Generation
Domain Specific (Math, Code, etc)
Machine Translation
Safety LM
Text2SQL
Korean Pretraining Dataset
updated
Nov 19
Upvote
10
heegyu/namuwiki-extracted
Viewer
•
Updated
Jan 15, 2023
•
565k
•
213
•
13
heegyu/kowikitext
Viewer
•
Updated
Oct 2, 2022
•
1.33M
•
218
•
5
maywell/korean_textbooks
Viewer
•
Updated
Jan 10
•
4.42M
•
875
•
106
heegyu/korean-petitions
Viewer
•
Updated
Jan 15, 2023
•
437k
•
162
•
7
hac541309/basic_korean_dict
Viewer
•
Updated
Jul 26, 2023
•
74.9k
•
77
•
4
lcw99/oscar-ko-only
Viewer
•
Updated
Oct 21, 2022
•
3.68M
•
110
•
3
uonlp/CulturaX
Viewer
•
Updated
8 days ago
•
7.18B
•
15.9k
•
484
Note
mC4 + OSCAR +
lbox/lbox_open
Viewer
•
Updated
Nov 9, 2022
•
301k
•
296
•
12
HAERAE-HUB/KOREAN-WEBTEXT
Viewer
•
Updated
May 31
•
1.28M
•
252
•
28
HAERAE-HUB/KOREAN-SyntheticText-1.5B
Viewer
•
Updated
Jul 22
•
1.55M
•
124
•
13
Upvote
10
+6
Share collection
View history
Collection guide
Browse collections