Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ernie-research
/
HH-RLHF-Gemma-7B-MA-PPO-Fixed5
like
0
Follow
ernie-research
7
Safetensors
Dahoas/full-hh-rlhf
gemma
arxiv:
2410.02743
License:
mit
Model card
Files
Files and versions
Community
93e3378
HH-RLHF-Gemma-7B-MA-PPO-Fixed5
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
Moyu-hrsun
initial commit
93e3378
verified
9 months ago
.gitattributes
Safe
1.52 kB
initial commit
9 months ago