Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
gabrielloiseau
/
TAROT-PPO
like
1
Text Generation
Transformers
Safetensors
Yelp/yelp_review_full
English
gpt2
ppo
text-generation-inference
Inference Endpoints
arxiv:
2407.21630
License:
gpl-3.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
5806213
TAROT-PPO
Commit History
Update README.md
5806213
verified
gabrielloiseau
commited on
Jul 30, 2024
Update README.md
7c8d472
verified
gabrielloiseau
commited on
Jul 30, 2024
Update README.md
91f2e7b
verified
gabrielloiseau
commited on
Jul 30, 2024
Update README.md
45f9f3a
verified
gabrielloiseau
commited on
Jul 30, 2024
Update README.md
94b99e8
verified
gabrielloiseau
commited on
Jul 29, 2024
Upload tokenizer
dad7071
verified
gabrielloiseau
commited on
Jul 25, 2024
Upload model
0e925f9
verified
gabrielloiseau
commited on
Jul 25, 2024
initial commit
f4e83ed
verified
gabrielloiseau
commited on
Jul 25, 2024