Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
dball
/
zephyr-7b-dpo-qlora
like
0
PEFT
TensorBoard
Safetensors
HuggingFaceH4/ultrafeedback_binarized
mistral
alignment-handbook
Generated from Trainer
trl
dpo
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
2
Train
Use this model
4953c55
zephyr-7b-dpo-qlora
Commit History
Training in progress, step 1300
42251be
verified
dball
commited on
Jan 23, 2024
Training in progress, step 1200
d4a1fc9
verified
dball
commited on
Jan 23, 2024
Training in progress, step 900
8799f0a
verified
dball
commited on
Jan 23, 2024
Training in progress, step 700
5858fc3
verified
dball
commited on
Jan 23, 2024
Training in progress, step 600
e2fa348
verified
dball
commited on
Jan 23, 2024
Training in progress, step 500
9b0f584
verified
dball
commited on
Jan 23, 2024
Training in progress, step 400
0c99ab6
verified
dball
commited on
Jan 23, 2024
Training in progress, step 300
dc31bf4
verified
dball
commited on
Jan 23, 2024
Training in progress, step 200
cf81ac7
verified
dball
commited on
Jan 23, 2024
initial commit
509eca3
verified
dball
commited on
Jan 23, 2024
Previous
1
2
Next