GRPO RL model
SunJack
SunJack
·
AI & ML interests
None yet
Recent Activity
updated
a collection
21 days ago
GRPO
updated
a model
21 days ago
SunJack/Qwen2.5-3B-R1-GGUF
updated
a model
21 days ago
SunJack/Qwen2.5-3B-R1
Organizations
Collections
1
models
14

SunJack/Qwen2.5-3B-R1-GGUF
Updated
•
139

SunJack/Qwen2.5-3B-R1
Updated
•
46

SunJack/Phi-4-R1
Updated

SunJack/Phi-4-R1-GGUF
Updated

SunJack/Qwen2.5-7b-sft
Updated
•
19

SunJack/phi4-o1
Updated
•
130

SunJack/Qwen2.5-3B-GRPO_lora
Updated

SunJack/qwen2.5-7b-o1
Updated
•
47
•
1

SunJack/qwen2.5-7b-cve
Updated
•
33
•
1

SunJack/qwen2-7b-ruozhiba-finetuning
Updated
•
67
•
2