Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
jkazdan
/
synthetic-dpo-gemma-2b-helpsteer2
like
0
Safetensors
gemma2
trl
dpo
Generated from Trainer
License:
gemma
Model card
Files
Files and versions
Community
main
synthetic-dpo-gemma-2b-helpsteer2
Commit History
jkazdan/aligned-model-dpo-gemma-2-2b-helpsteer2
9ddc325
verified
jkazdan
commited on
Oct 14, 2024
jkazdan/synthetic-dpo-gemma-2-2b-helpsteer2
e825398
verified
jkazdan
commited on
Oct 13, 2024
initial commit
f67b0df
verified
jkazdan
commited on
Oct 13, 2024