llm-jp-3-13b-instruct2-grpo-0222_step2000 / model-00001-of-00006.safetensors

Commit History

Trained with Unsloth
e6b0345
verified

morizon commited on