chitanda/llama2.7b.chat.reclor.lqv2-step-dpo.step.dpo.fix_hack.H100.w4.v1.0.s42 Updated about 24 hours ago
chitanda/llama2.7b.chat.reclor.lqv2-step-dpo.step.dpo.fix_hack.H100.w4.v1.0.s42 Updated about 24 hours ago
chitanda/deepseek-math.7b.ins.meta_math_cot.math55k.n5.critic_correct.dpo.H100.w4.v3.0.s42 Updated 6 days ago
chitanda/deepseek-math.7b.ins.meta_math_cot.math55k.n5.critic_correct.dpo.H100.w4.v3.0.s42 Updated 6 days ago
chitanda/deepseek-math.7b.ins.meta_math_cot.math55k.n5.critic_correct.dpo.H100.w4.v2.0.s42 Updated 7 days ago
chitanda/deepseek-math.7b.ins.meta_math_cot.math55k.n5.critic_correct.dpo.H100.w4.v2.0.s42 Updated 7 days ago
chitanda/deepseek-math.7b.ins.meta_math_cot.math55k.n5.critic_correct.dpo.H100.w4.v3.1.s42 Updated 14 days ago
chitanda/deepseek-math.7b.ins.meta_math_cot.math55k.n5.critic_correct.dpo.H100.w4.v3.1.s42 Updated 14 days ago
Magpie-Align/Magpie-Reasoning-V1-150K-CoT-Deepseek-R1-Llama-70B Viewer • Updated about 1 month ago • 150k • 1.38k • 16
Magpie-Align/Magpie-Reasoning-V2-250K-CoT-Deepseek-R1-Llama-70B Viewer • Updated about 1 month ago • 250k • 6.05k • 84