Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
claudiubarbu
/
reward
like
0
Text Classification
Transformers
Safetensors
piqa
gpt2
trl
reward-trainer
Generated from Trainer
text-generation-inference
Inference Endpoints
License:
mit
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
reward
/
README.md
Commit History
HW2-reward
572f948
verified
claudiubarbu
commited on
Sep 10
Training in progress, step 500
1f8e018
verified
claudiubarbu
commited on
Sep 10
HW2-reward
e5ff3b5
verified
claudiubarbu
commited on
Sep 10
HW2-reward
61067fc
verified
claudiubarbu
commited on
Aug 30