ubermenchh
/

llama3.1-8B-gsm8k-grpo

File size: 57 Bytes

99dce61
 
b42217f
 
 
 
99dce61

---

license: mit
tags:
- unsloth
- trl
- grpo
---