My first ever usage of GRPO fine tuning techniques, information learned from this model will be used on future Andy models.
Sweaterdog
Sweaterdog
AI & ML interests
None yet
Recent Activity
updated
a collection
2 days ago
Smol-reason
updated
a collection
2 days ago
Smol-reason
published
a model
2 days ago
Sweaterdog/Smol-reason2-LoRA
Organizations
None yet
Collections
2
models
17
Sweaterdog/Smol-reason2-LoRA
Updated
Sweaterdog/Smol-reason2
Updated
Sweaterdog/Smol-Reason
Updated
•
223
Sweaterdog/Andy-3.6-small-GRPO-LoRA
Updated
Sweaterdog/Andy-3.6-small-GRPO
Updated
Sweaterdog/Smol-reason-LoRA
Updated
Sweaterdog/Andy-3.6-small
Updated
•
95
Sweaterdog/Andy-3.6
Updated
•
803
•
3
Sweaterdog/Andy-3.6-LoRA
Updated
•
44
•
1
Sweaterdog/Andy-3.6-small-LoRA
Updated