kaiwenw/nov22_lr_3e-6_lora_32_dropout_0.1_all_reject_first_ep_4 Text Generation • Updated Dec 7, 2024 • 20
kaiwenw/nov22_lr_3e-6_lora_32_dropout_0.1_all_reject_first_ep_3 Text Generation • Updated Dec 7, 2024 • 21
kaiwenw/nov22_lr_3e-6_lora_32_dropout_0.1_all_reject_first_ep_2 Text Generation • Updated Dec 7, 2024 • 17
kaiwenw/nov22_lr_3e-6_lora_32_dropout_0.1_all_reject_first_ep_1 Text Generation • Updated Dec 7, 2024 • 39
kaiwenw/distill-r1-qwen-1.5b-hmmt-feb-25-4096-with-bt-model-with-sigmoid Viewer • Updated May 6 • 123k • 20
kaiwenw/distill-r1-qwen-1.5b-hmmt-feb-24-4096-with-bt-model-with-sigmoid Viewer • Updated May 6 • 123k • 20
kaiwenw/distill-r1-qwen-1.5b-aime-25-4096-with-bt-model-with-sigmoid Viewer • Updated May 6 • 123k • 20
kaiwenw/distill-r1-qwen-1.5b-aime-24-4096-with-bt-model-with-sigmoid Viewer • Updated May 6 • 123k • 11
kaiwenw/distill-r1-qwen-1.5b-hmmt-feb-25-4096-with-bt-model-wout-sigmoid Viewer • Updated May 6 • 123k • 62
kaiwenw/distill-r1-qwen-1.5b-hmmt-feb-24-4096-with-bt-model-wout-sigmoid Viewer • Updated May 6 • 123k • 10
kaiwenw/distill-r1-qwen-1.5b-aime-25-4096-with-bt-model-wout-sigmoid Viewer • Updated May 6 • 123k • 77
kaiwenw/distill-r1-qwen-1.5b-aime-24-4096-with-bt-model-wout-sigmoid Viewer • Updated May 6 • 123k • 1.48k
kaiwenw/distill-r1-qwen-1.5b-hmmt-feb-25-4096-with-old-prm-indices_61440_69120 Viewer • Updated May 6 • 7.68k • 26
kaiwenw/distill-r1-qwen-1.5b-hmmt-feb-25-4096-with-old-prm-indices_76800_84480 Viewer • Updated May 6 • 7.68k • 26