rlhflow-llama-3-sft-8b-v2-token-ppo-60k / model-00004-of-00004.safetensors

Commit History

Upload LlamaForCausalLM
9fec59f
verified

yyqoni commited on