mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz256_lr2e5_epochs7 Text Generation • 2B • Updated Jun 24 • 5
mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz256_lr4e5_epochs7 Text Generation • 2B • Updated Jun 24 • 5
mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz256_lr4e5_epochs5 Text Generation • 2B • Updated Jun 24 • 5
mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz512_lr2e5_epochs5 Text Generation • 2B • Updated Jun 24 • 5
mlfoundations-dev/Qwen2.5-7B-Instruct_openthoughts3_300k_annotated_Qwen3-32B Text Generation • 8B • Updated Jun 23 • 5 • 1
mlfoundations-dev/DeepSeek-R1-Distill-Qwen-7B_OpenThoughts3 Text Generation • 8B • Updated Jun 23 • 4
mlfoundations-dev/DeepSeek-R1-Distill-Qwen-1.5B_OpenThoughts3 Text Generation • 2B • Updated Jun 18 • 6
mlfoundations-dev/QwQ-32B_enable-liger-kernel_False_OpenThoughts3_10k Text Generation • 33B • Updated Jun 17 • 6
mlfoundations-dev/QwQ-32B_enable-liger-kernel_False_OpenThoughts3_3k Text Generation • 33B • Updated Jun 17 • 6
mlfoundations-dev/QwQ-32B_enable-liger-kernel_False_OpenThoughts3_1k Text Generation • 33B • Updated Jun 16 • 6
mlfoundations-dev/Qwen2.5-7B-Instruct_openthoughts3_math_100k_annotated_QwQ-32B Text Generation • 8B • Updated Jun 16 • 6
mlfoundations-dev/qwen_lawma_deepseek-2k-5x-majority_verified Text Generation • 8B • Updated May 27 • 9