Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
wirthdrew1
/
zephyr-7b-dpo-qlora
like
1
PEFT
TensorBoard
Safetensors
HuggingFaceH4/ultrafeedback_binarized
mistral
alignment-handbook
Generated from Trainer
trl
dpo
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Use this model
main
zephyr-7b-dpo-qlora
/
runs
/
Jan19_20-14-26_wirandre-work-gpu-1
/
events.out.tfevents.1705695351.wirandre-work-gpu-1.3520.0
Commit History
Training in progress, step 2700
2f6d7f4
verified
wirthdrew1
commited on
Jan 20
Training in progress, step 2600
e5c9202
verified
wirthdrew1
commited on
Jan 20
Training in progress, step 2500
3731962
verified
wirthdrew1
commited on
Jan 20
Training in progress, step 2400
98b85f2
verified
wirthdrew1
commited on
Jan 20
Training in progress, step 2300
bdfe03e
verified
wirthdrew1
commited on
Jan 20
Training in progress, step 2200
bf7efb4
verified
wirthdrew1
commited on
Jan 20
Training in progress, step 2100
cbeb079
verified
wirthdrew1
commited on
Jan 19
Training in progress, step 2000
07e214a
verified
wirthdrew1
commited on
Jan 19
Training in progress, step 1900
0b50296
verified
wirthdrew1
commited on
Jan 19
Training in progress, step 1800
65e75d6
verified
wirthdrew1
commited on
Jan 19
Training in progress, step 1700
9134be7
verified
wirthdrew1
commited on
Jan 19
Previous
1
2
Next