Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
minionKP
/
reward_model_output
like
0
PEFT
Safetensors
llama
trl
reward-trainer
Generated from Trainer
License:
llama3
Model card
Files
Files and versions
Community
Train
Use this model
main
reward_model_output
/
adapter_config.json
Commit History
End of training
5f7b803
verified
minionKP
commited on
Aug 27
End of training
f67d41b
verified
minionKP
commited on
Aug 27
End of training
76b3da1
verified
minionKP
commited on
Aug 26