Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
JaishreeramCoder
/
reward_model
like
0
Text Classification
Transformers
Safetensors
gemma2
trl
reward-trainer
text-generation-inference
4-bit precision
bitsandbytes
arxiv:
1910.09700
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
reward_model
1 contributor
History:
3 commits
JaishreeramCoder
Upload tokenizer
f4064b0
verified
5 months ago
.gitattributes
Safe
1.57 kB
Upload tokenizer
5 months ago
README.md
Safe
5.19 kB
Upload Gemma2ForSequenceClassification
5 months ago
config.json
Safe
1.66 kB
Upload Gemma2ForSequenceClassification
5 months ago
model.safetensors
Safe
2.32 GB
LFS
Upload Gemma2ForSequenceClassification
5 months ago
special_tokens_map.json
Safe
522 Bytes
Upload tokenizer
5 months ago
tokenizer.json
Safe
34.4 MB
LFS
Upload tokenizer
5 months ago
tokenizer.model
Safe
4.24 MB
LFS
Upload tokenizer
5 months ago
tokenizer_config.json
Safe
46.4 kB
Upload tokenizer
5 months ago