myselfrew/llama3_self_gen_n140_filter_3e6_bz32_packing_8192_2epoch Text Generation • Updated 9 days ago • 11
myselfrew/llama3_self_gen_n140_filter_3e6_bz32_packing_8192_3epoch Text Generation • Updated 9 days ago • 11
myselfrew/llama3_self_gen_n40_filter_2e6_bz128_no_packing_plus_train_on_correct_2epoch Text Generation • Updated 11 days ago • 33
myselfrew/llama3_8b_selfgenn40_2e6_bz128_nopacking_also_train_reward_2epoch Text Generation • Updated 11 days ago • 27
myselfrew/llama3_8b_learn_from_70b_data_n4_filter_2e6_bz32_pack8192_also_train_reward_3epoch Text Generation • Updated 11 days ago • 9
myselfrew/llama3_8b_learn_from_70b_data_n4_filter_2e6_bz32_pack8192_also_train_reward_2epoch Text Generation • Updated 11 days ago • 11
myselfrew/llama3_8b_math_new_prompt_filtered_no_self_correction_sft Viewer • Updated 7 days ago • 315k • 85
myselfrew/llama31_8b_math_new_prompt_filtered_no_self_correction_sft Viewer • Updated 8 days ago • 375k • 39