-
CohenQu/Qwen3-4B-Base_HintGen-withSol.00.00
Text Generation • 4B • Updated • 1 -
CohenQu/Qwen3-4B-Base_HintGen-withSol.00.01
Text Generation • 4B • Updated • 1 -
CohenQu/Qwen3-4B-Base_HintGen-withSol.01.00
Text Generation • 4B • Updated • 7 -
CohenQu/Qwen3-4B-Base_HintGen-withSol.01.01
Text Generation • 4B • Updated • 3
Yuxiao Qu PRO
CohenQu
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 20 hours ago
CohenQu/Continue_vs_Terminate.05.00
updated
a dataset
about 21 hours ago
CohenQu/Continue_vs_Terminate.05.01
published
a dataset
about 21 hours ago
CohenQu/Continue_vs_Terminate.05.01
Organizations
Flexible Ordering
-
CohenQu/llama3_3b-finemath-4plus-flexible-ordering.00.02
3B • Updated • 3 -
CohenQu/llama3_3b-finemath-4plus-flexible-ordering.00.03
3B • Updated • 3 -
CohenQu/llama3_3b-finemath-4plus-flexible-ordering.00.04
3B • Updated • 3 -
CohenQu/sft_completion_only_finemath-4plus-flexible-ordering.00.00-Step2000_numina-cot-100k_babel
Text Generation • 4B • Updated • 3
RLAD
-
CohenQu/Qwen3-4B-Base_HintGen-withSol.00.00
Text Generation • 4B • Updated • 1 -
CohenQu/Qwen3-4B-Base_HintGen-withSol.00.01
Text Generation • 4B • Updated • 1 -
CohenQu/Qwen3-4B-Base_HintGen-withSol.01.00
Text Generation • 4B • Updated • 7 -
CohenQu/Qwen3-4B-Base_HintGen-withSol.01.01
Text Generation • 4B • Updated • 3
Flexible Ordering
-
CohenQu/llama3_3b-finemath-4plus-flexible-ordering.00.02
3B • Updated • 3 -
CohenQu/llama3_3b-finemath-4plus-flexible-ordering.00.03
3B • Updated • 3 -
CohenQu/llama3_3b-finemath-4plus-flexible-ordering.00.04
3B • Updated • 3 -
CohenQu/sft_completion_only_finemath-4plus-flexible-ordering.00.00-Step2000_numina-cot-100k_babel
Text Generation • 4B • Updated • 3
models
357

CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.02.04_long-35000_numina-cot-100k_orchard
Text Generation
•
4B
•
Updated
•
13

CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.02.04_long-30000_numina-cot-100k_orchard
Text Generation
•
4B
•
Updated
•
7

CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.02.04_long-25000_numina-cot-100k_orchard
Text Generation
•
4B
•
Updated
•
7

CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.02.04_long-20000_numina-cot-100k_orchard
Text Generation
•
4B
•
Updated
•
7

CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.02.04_long-10000_numina-cot-100k_orchard
Text Generation
•
4B
•
Updated
•
7

CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.02.04_long-5000_numina-cot-100k_orchard
Text Generation
•
4B
•
Updated
•
7

CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.00.00_long-15000_numina-cot-100k_orchard
Text Generation
•
4B
•
Updated
•
11

CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.02.04_long-15000_numina-cot-100k_orchard
Text Generation
•
4B
•
Updated
•
7

CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.00.00_long-25000_numina-cot-100k_orchard
Text Generation
•
4B
•
Updated
•
15

CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.02.01_long-30000_numina-cot-100k_orchard
Text Generation
•
4B
•
Updated
•
7
datasets
189
CohenQu/Continue_vs_Terminate.05.00
Viewer
•
Updated
•
6.96k
CohenQu/Continue_vs_Terminate.05.01
Viewer
•
Updated
•
6.96k
CohenQu/continue_vs_terminate_Qwen3-1.7B_DAPO-Math-en_0-2000_8000-10000_counterfactual_reason
Viewer
•
Updated
•
6.96k
•
121
CohenQu/arxiv_rlad_math_reasoning_benchmark_hints_gen_iter1_sol_prompt
Viewer
•
Updated
•
270
•
39
CohenQu/arxiv_rlad_math_reasoning_benchmark_hints_iter1_prompt
Viewer
•
Updated
•
34
•
22
CohenQu/arxiv_rlad_math_reasoning_benchmark_hints
Viewer
•
Updated
•
600
•
72
CohenQu/Joint_train_AceReason_AIME_HMMT_filter_RL
Viewer
•
Updated
•
3.9k
•
146
CohenQu/Joint_train_stage3_filter_RL
Viewer
•
Updated
•
4.14k
•
107
CohenQu/Joint_train_AIME_HMMT_RL
Viewer
•
Updated
•
190
•
113
CohenQu/finemath-4plus-flexible-ordering.02.02
Viewer
•
Updated
•
13.4M
•
322