The collection for the Project "Simple Reinforcement Learning for Reasoning"
HKUST NLP Group
university
AI & ML interests
None defined yet.
Recent Activity
View all activity
Collections
6
models
35
hkust-nlp/preselect-fasttext-classifier
Text Classification
•
Updated
•
72
•
4
hkust-nlp/Qwen-2.5-Math-7B-SimpleRL-Zero
Updated
•
311
•
2
hkust-nlp/Qwen-2.5-Math-7B-SimpleRL
Updated
•
387
•
3
hkust-nlp/qwen2.5-7b-coder_codeio_stage1
Updated
•
44
hkust-nlp/qwen2.5-7b-coder_codeio
Updated
•
40
hkust-nlp/qwen2.5-7b-coder_codeio_pp_stage1
Updated
•
55
hkust-nlp/qwen2.5-7b-coder_codeio_pp
Updated
•
88
•
5
hkust-nlp/llama3.1-8b_codeio_stage1
Updated
•
36
hkust-nlp/llama3.1-8b_codeio
Updated
•
35
hkust-nlp/llama3.1-8b_codeio_pp_stage1
Updated
•
40
datasets
21
hkust-nlp/PreSelect-100B
Viewer
•
Updated
•
54.5M
•
1.18k
•
8
hkust-nlp/CodeIO-PyEdu-Reasoning
Preview
•
Updated
•
818
•
45
hkust-nlp/CodeIO-PyEdu-Reasoning-Raw
Updated
•
142
hkust-nlp/SynCSE-partial-NLI
Viewer
•
Updated
•
263k
•
100
•
2
hkust-nlp/SynCSE-scratch-NLI
Viewer
•
Updated
•
276k
•
137
•
2
hkust-nlp/gsm8k-fix
Viewer
•
Updated
•
7.47k
•
117
•
2
hkust-nlp/dart-math-uniform
Viewer
•
Updated
•
591k
•
134
•
9
hkust-nlp/vrt-baseline
Viewer
•
Updated
•
591k
•
70
•
1
hkust-nlp/dart-math-hard
Viewer
•
Updated
•
585k
•
139
•
13
hkust-nlp/dart-math-pool-gsm8k-query-info
Viewer
•
Updated
•
7.47k
•
78
•
2