Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
nicholasKluge
/
Harmless-RewardModel
like
1
Text Classification
Transformers
Safetensors
nicholasKluge/harmless-aira-dataset
Anthropic/hh-rlhf
English
roberta
reward model
alignment
preference model
RLHF
Carbon Emissions
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
main
Harmless-RewardModel
/
README.md
Commit History
Update README.md
3cf2a1e
verified
nicholasKluge
commited on
Jun 9
Update README.md
f4f2060
verified
nicholasKluge
commited on
Jun 9
Update README.md
1ba5b15
verified
nicholasKluge
commited on
Jun 18, 2024
Update README.md
fed22d0
verified
nicholasKluge
commited on
May 27, 2024
Update README.md
7a584f3
verified
nicholasKluge
commited on
May 27, 2024
Create README.md
c3ba6f1
verified
nicholasKluge
commited on
May 27, 2024