Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
wengnews
/
tuning_llama_rl_checkpointsstep_9
like
0
Reinforcement Learning
Transformers
PyTorch
trl
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
93315b9
tuning_llama_rl_checkpointsstep_9
Commit History
initial commit
93315b9
wengnews
commited on
May 11, 2023