Vexamologist
vexamologist
AI & ML interests
None yet
Recent Activity
reacted
to
burtenshaw's
post
with 👍
12 days ago
Still speed running Gemma 3 to think. Today I focused on setting up gpu poor hardware to run GRPO.
This is a plain TRL and PEFT notebook which works on mac silicone or colab T4. This uses the 1b variant of Gemma 3 and a reasoning version of GSM8K dataset.
🧑🍳 There’s more still in the oven like releasing models, an Unsloth version, and deeper tutorials, but hopefully this should bootstrap your projects.
Here’s a link to the 1b notebook: https://colab.research.google.com/drive/1mwCy5GQb9xJFSuwt2L_We3eKkVbx2qSt?usp=sharing
upvoted
a
collection
18 days ago
Cantonese Dataset
liked
a model
18 days ago
hon9kon9ize/bert-large-cantonese
Organizations
models
None public yet
datasets
None public yet