kaiwenw/nov22_lr_3e-6_lora_32_dropout_0.1_all_reject_first_ep_4 Text Generation • Updated Dec 7, 2024 • 1
kaiwenw/nov22_lr_3e-6_lora_32_dropout_0.1_all_reject_first_ep_3 Text Generation • Updated Dec 7, 2024 • 1
kaiwenw/nov22_lr_3e-6_lora_32_dropout_0.1_all_reject_first_ep_2 Text Generation • Updated Dec 7, 2024 • 1
kaiwenw/nov22_lr_3e-6_lora_32_dropout_0.1_all_reject_first_ep_1 Text Generation • Updated Dec 7, 2024 • 1
kaiwenw/open_r1_apr9_DeepSeek_R1_Distill_Qwen_1.5B_tokenized Viewer • Updated 2 days ago • 49.4k • 25
kaiwenw/open_r1_mar2_DeepSeek_R1_Distill_Qwen_1.5B_tokenized Viewer • Updated 9 days ago • 49.5k • 105
kaiwenw/open_r1_mar2_DeepSeek_R1_Distill_Qwen_32B_tokenized Viewer • Updated 10 days ago • 49.5k • 32