loyal-piano-m7-cdpo / README.md
chargoddard's picture
Create README.md
1f88a5f
|
raw
history blame
161 Bytes
metadata
license: cc-by-nc-4.0
datasets:
  - HuggingFaceH4/ultrafeedback_binarized

Trained for one epoch on ultrafeedback_binarized using cDPO. Evaluation pending.