Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
DrishtiSharma
/
doplhin-2.1-mistral-7b-orpo-ultrafeedback-binarized-preferences
like
0
PEFT
TensorBoard
Safetensors
trl
orpo
Generated from Trainer
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Use this model
main
doplhin-2.1-mistral-7b-orpo-ultrafeedback-binarized-preferences
/
adapter_model.safetensors
Commit History
End of training
6a2a7ff
verified
DrishtiSharma
commited on
Apr 12
Training in progress, step 2500
96a008c
verified
DrishtiSharma
commited on
Apr 12
Training in progress, step 2000
b5b03d5
verified
DrishtiSharma
commited on
Apr 12
Training in progress, step 1500
51fd869
verified
DrishtiSharma
commited on
Apr 12
Training in progress, step 1000
f0ad00b
verified
DrishtiSharma
commited on
Apr 12
Training in progress, step 500
ec18cca
verified
DrishtiSharma
commited on
Apr 12