selfcorrexp2/llama3_sft_less_corr_rr0k_ep3_train_on_reasoning Text Generation • Updated 3 days ago • 33
selfcorrexp2/llama3_sft_balanced_corr_rr0k_ep3_train_on_reasoning Text Generation • Updated 4 days ago • 15