Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Setpember
/
Jon_pythia_DPO_epi_1
like
0
TensorBoard
Safetensors
gpt_neox
trl
dpo
Generated from Trainer
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
main
Jon_pythia_DPO_epi_1
/
runs
1 contributor
History:
3 commits
Setpember
End of training
66c068a
verified
4 months ago
Nov18_06-38-04_f8e097f86709
End of training
4 months ago
Nov18_10-10-53_f308ff891443
End of training
4 months ago
Nov18_10-20-29_f308ff891443
End of training
4 months ago
Nov18_10-53-05_b6a526804a78
End of training
4 months ago