CohenQu/Joint-Train-deepscalar_RL_hard_500_verl_0.35_0.001_0.001_32_32_20k_4_0713 2B • Updated 21 days ago • 111
CohenQu/Joint-Train-deepscalar_RL_hard_500_verl_0.35_0.001_0.001_32_32_20k_4_0710 2B • Updated 24 days ago • 8
CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.00.08-6000_numina-cot-100k_orchard Text Generation • 4B • Updated Jul 1 • 4
CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.00.08-8000_numina-cot-100k_orchard Text Generation • 4B • Updated Jul 1 • 5
CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.00.08-2000_numina-cot-100k_orchard Text Generation • 4B • Updated Jul 1 • 5
CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.00.08-4000_numina-cot-100k_orchard Text Generation • 4B • Updated Jul 1 • 4
CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.00.09-6000_numina-cot-100k_orchard Text Generation • 4B • Updated Jul 1 • 8
CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.00.09-8000_numina-cot-100k_orchard Text Generation • 4B • Updated Jul 1 • 4
CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.00.09-4000_numina-cot-100k_orchard Text Generation • 4B • Updated Jul 1 • 4
CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.00.09-2000_numina-cot-100k_orchard Text Generation • 4B • Updated Jul 1 • 7
CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.00.08-6000_numina-cot-100k_babel Text Generation • 4B • Updated Jul 1 • 6
CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.00.00_small_cpts-7500_numina-cot-100k_orchard Text Generation • 4B • Updated Jul 1 • 6
CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.00.00_small_cpts-8000_numina-cot-100k_babel Text Generation • 4B • Updated Jun 30 • 5
CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.00.00_small_cpts-7000_numina-cot-100k_babel Text Generation • 4B • Updated Jun 30 • 5
CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.00.00_small_cpts-6500_numina-cot-100k_babel Text Generation • 4B • Updated Jun 29 • 5
CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.00.00_small_cpts-6000_numina-cot-100k_babel Text Generation • 4B • Updated Jun 29 • 6
CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.00.00_small_cpts-5500_numina-cot-100k_babel Text Generation • 4B • Updated Jun 29 • 5