Qwen-2.5-3b-instruct-GRPO-250 / model-00002-of-00002.safetensors

Commit History

Trained with Unsloth
e5248e1
verified

BraylonDash commited on

Trained with Unsloth
f879bab
verified

BraylonDash commited on