RLHF-PPO-PPOModel-LLama3-1B-v1.3 / generation_config.json
bikalnetomi's picture
End of training
6677d7a verified
raw
history blame contribute delete
124 Bytes
{
"bos_token_id": 128000,
"do_sample": true,
"temperature": 0.6,
"top_p": 0.9,
"transformers_version": "4.46.3"
}