selfcorrexp2/llama3_sft_less_corr_train_on_corr_dpo_gen2_augmath Viewer • Updated 1 day ago • 19.9k • 2
selfcorrexp2/llama3_sft_less_corr_train_on_corr_dpo_gen1_math_2nd_round_prompt Viewer • Updated 1 day ago • 18.7k • 3
selfcorrexp2/llama3_sft_less_corr_train_on_corr_dpo_gen1_augmath_2nd_round_prompt Viewer • Updated 1 day ago • 19.9k • 3
selfcorrexp2/llama3_sft_less_corr_train_on_corr_dpo_gen1_augmath Viewer • Updated 1 day ago • 7.57k • 4
selfcorrexp2/llama3_sft_less_corr_rr0k_ep3_train_on_reasoning Text Generation • Updated 3 days ago • 33