Qwen2.5-7B-sft-hhrlhf-gen-dpo / model.safetensors.index.json

Commit History

Training in progress, epoch 1
38df07f
verified

AmberYifan commited on