Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
myradeng
/
dpo-mistral-math-v4
like
0
Transformers
Safetensors
Generated from Trainer
trl
dpo
Inference Endpoints
arxiv:
2305.18290
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
dpo-mistral-math-v4
Commit History
Model save
2453f05
verified
myradeng
commited on
Nov 29, 2024
Training in progress, step 2857
ee8eeab
verified
myradeng
commited on
Nov 29, 2024
Training in progress, step 2500
8244d46
verified
myradeng
commited on
Nov 28, 2024
Training in progress, step 2000
e2bbb36
verified
myradeng
commited on
Nov 28, 2024
Training in progress, step 1500
1233877
verified
myradeng
commited on
Nov 28, 2024
Training in progress, step 1000
c505004
verified
myradeng
commited on
Nov 28, 2024
Training in progress, step 500
1a2a049
verified
myradeng
commited on
Nov 27, 2024
initial commit
09b8022
verified
myradeng
commited on
Nov 27, 2024