llm-jp-3-13b-instruct2-grpo-0222_step1000 / model-00002-of-00006.safetensors

Commit History

Trained with Unsloth
7907efb
verified

morizon commited on