Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Ray2333
/
GRM-Llama3.2-3B-rewardmodel-ft
like
7
Text Classification
Safetensors
Skywork/Skywork-Reward-Preference-80K-v0.2
llama
arxiv:
2406.10216
License:
apache-2.0
Model card
Files
Files and versions
Community
2
Train
9ac3e42
GRM-Llama3.2-3B-rewardmodel-ft
Commit History
Upload tokenizer
9ac3e42
verified
Ray2333
commited on
Oct 23, 2024
Upload LlamaForSequenceClassification
0d2e996
verified
Ray2333
commited on
Oct 23, 2024
initial commit
241cf11
verified
Ray2333
commited on
Oct 23, 2024