Llama-3.1-8B-sft-hhrlhf-dpo / last-checkpoint
AmberYifan's picture
Training in progress, epoch 3, checkpoint
c60270c verified