Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Ray2333
/
GRM-Llama3-8B-rewardmodel-ft
like
1
Safetensors
Skywork/Skywork-Reward-Preference-80K-v0.1
llama
arxiv:
2406.10216
License:
mit
Model card
Files
Files and versions
Community
Train
f993d6a
GRM-Llama3-8B-rewardmodel-ft
Commit History
Update config.json
f993d6a
verified
Ray2333
commited on
Sep 17, 2024
Update config.json
f3c759f
verified
Ray2333
commited on
Sep 17, 2024
Upload tokenizer
1cf8806
verified
Ray2333
commited on
Sep 17, 2024
Upload LlamaForSequenceClassification
cba986c
verified
Ray2333
commited on
Sep 17, 2024
initial commit
8e68bb7
verified
Ray2333
commited on
Sep 17, 2024