Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
claudiubarbu
/
reward
like
0
Text Classification
Transformers
Safetensors
piqa
gpt2
trl
reward-trainer
Generated from Trainer
text-generation-inference
Inference Endpoints
License:
mit
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
1f8e018
reward
Commit History
Training in progress, step 500
1f8e018
verified
claudiubarbu
commited on
Sep 10
HW2-reward
e5ff3b5
verified
claudiubarbu
commited on
Sep 10
Training in progress, step 6000
ee6ea51
verified
claudiubarbu
commited on
Sep 10
Training in progress, step 5500
db973e5
verified
claudiubarbu
commited on
Sep 10
Training in progress, step 5000
9e50856
verified
claudiubarbu
commited on
Sep 10
Training in progress, step 4500
4f66dea
verified
claudiubarbu
commited on
Sep 10
Training in progress, step 4000
1239ce3
verified
claudiubarbu
commited on
Sep 10
Training in progress, step 3500
0f68f9e
verified
claudiubarbu
commited on
Sep 10
Training in progress, step 3000
1201958
verified
claudiubarbu
commited on
Sep 10
Training in progress, step 2500
d299e43
verified
claudiubarbu
commited on
Sep 10
Training in progress, step 2000
56e7f0b
verified
claudiubarbu
commited on
Sep 10
Training in progress, step 1500
f6de157
verified
claudiubarbu
commited on
Sep 10
Training in progress, step 1000
9f9e765
verified
claudiubarbu
commited on
Sep 10
Training in progress, step 500
21f9460
verified
claudiubarbu
commited on
Sep 10
Training in progress, step 21
b91e0ea
verified
claudiubarbu
commited on
Sep 1
HW2-reward
61067fc
verified
claudiubarbu
commited on
Aug 30
Training in progress, step 39
525b61b
verified
claudiubarbu
commited on
Aug 30
initial commit
6edb124
verified
claudiubarbu
commited on
Aug 30