jwang2373/updated_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v2 Viewer • Updated Feb 19 • 29.3k • 91
jwang2373/updated_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v1 Viewer • Updated Feb 19 • 29.3k • 74
jwang2373/dataset_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212_2_global_step_70filtered Viewer • Updated Feb 17 • 29.3k • 100
jwang2373/dataset_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212_2_global_step_70 Viewer • Updated Feb 17 • 118k • 76