Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
anakin87
/
gemma-2b-orpo
like
28
Text Generation
Transformers
Safetensors
alvarobartt/dpo-mix-7k-simplified
English
gemma
trl
orpo
Generated from Trainer
conversational
Eval Results
text-generation-inference
Inference Endpoints
arxiv:
2403.07691
License:
gemma-terms-of-use
Model card
Files
Files and versions
Community
2
Train
Deploy
Use this model
dda322f
gemma-2b-orpo
/
README.md
Commit History
Adding Evaluation Results
dda322f
verified
leaderboard-pr-bot
commited on
May 6, 2024
link to GGUF version
76e5b9c
verified
anakin87
commited on
Apr 6, 2024
improve readme
1a06bf8
anakin87
commited on
Mar 26, 2024
fix
cd3951b
anakin87
commited on
Mar 25, 2024
fixes
189413b
anakin87
commited on
Mar 25, 2024
improve readme
ce4ba3c
anakin87
commited on
Mar 25, 2024
material
4db7146
anakin87
commited on
Mar 25, 2024
Update README.md
5cbf999
verified
anakin87
commited on
Mar 25, 2024
little change
15a13e0
anakin87
commited on
Mar 25, 2024
End of training
7fbb0bb
verified
anakin87
commited on
Mar 24, 2024