Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
ZefanW
/
dpo-baseline
like
0
Text Generation
Transformers
Safetensors
llama
trl
Generated from Trainer
conversational
text-generation-inference
Inference Endpoints
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
dpo-baseline
/
model-00004-of-00004.safetensors
Commit History
Model save
baaa74d
verified
ZefanW
commited on
20 days ago
Training in progress, step 416
9a940ab
verified
ZefanW
commited on
20 days ago
Training in progress, step 384
3055239
verified
ZefanW
commited on
20 days ago
Training in progress, step 352
02cbf30
verified
ZefanW
commited on
20 days ago
Training in progress, step 320
87a6e89
verified
ZefanW
commited on
20 days ago
Training in progress, step 288
452bce6
verified
ZefanW
commited on
20 days ago
Training in progress, step 256
1e025f4
verified
ZefanW
commited on
20 days ago
Training in progress, step 224
29b01f7
verified
ZefanW
commited on
20 days ago
Training in progress, step 192
4c23551
verified
ZefanW
commited on
20 days ago
Training in progress, step 160
13c9dd8
verified
ZefanW
commited on
20 days ago
Training in progress, step 128
c7748c3
verified
ZefanW
commited on
20 days ago
Training in progress, step 96
b538b99
verified
ZefanW
commited on
20 days ago
Training in progress, step 64
bca052e
verified
ZefanW
commited on
20 days ago
Training in progress, step 32
1b3e144
verified
ZefanW
commited on
20 days ago