Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
anakin87
/
gemma-2b-orpo
like
28
Text Generation
Transformers
Safetensors
alvarobartt/dpo-mix-7k-simplified
English
gemma
trl
orpo
Generated from Trainer
conversational
Eval Results
text-generation-inference
arxiv:
2403.07691
License:
gemma-terms-of-use
Model card
Files
Files and versions
xet
Community
2
Train
Deploy
Use this model
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (2)
Sort: Recently created
Welcome to the community
The community tab is the place to discuss and collaborate with the HF community!