selfcorrexp/llama3_non_delete_rr40k_3ep_dpo_gen_augmath_1_type3_cut Viewer • Updated Jan 11 • 5.12k • 9
selfcorrexp/llama3_non_delete_rr40k_3ep_dpo_gen_augmath_1_type4_cut Viewer • Updated Jan 11 • 5.49k • 9
selfcorrexp/llama3_non_delete_rr40k_3ep_dpo_gen_math_1_type4_cut Viewer • Updated Jan 11 • 4.66k • 10
selfcorrexp/type1_and_halftype2_halftype3_and_halftype4_separate_pr Viewer • Updated Jan 11 • 28.9k • 4
selfcorrexp/llama3_non_delete_rr40k_3ep_dpo_gen_augmath_1_type1_1pair_per_idx Viewer • Updated Jan 9 • 5.49k • 8