Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
nike00811
/
dpo-full-0715
like
0
Text Generation
Transformers
Safetensors
mistral
trl
dpo
Generated from Trainer
conversational
text-generation-inference
Inference Endpoints
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
dpo-full-0715
Commit History
Model save
6240a73
verified
nike00811
commited on
Jul 15
Training in progress, step 27
7bb2416
verified
nike00811
commited on
Jul 15
initial commit
30ec433
verified
nike00811
commited on
Jul 15