v1_1000_STEPS_1e5_rate_05_beta_DPO / model-00002-of-00003.safetensors

Commit History

End of training
5b89da6
verified

tsavage68 commited on