Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Setpember
/
Jon_pythia_DPO_epi_1
like
0
TensorBoard
Safetensors
gpt_neox
trl
dpo
Generated from Trainer
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
main
Jon_pythia_DPO_epi_1
/
training_args.bin
Commit History
End of training
66c068a
verified
Setpember
commited on
Nov 18, 2024
End of training
f02c9aa
verified
Setpember
commited on
Nov 18, 2024
End of training
7797583
verified
Setpember
commited on
Nov 18, 2024