Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
4
4
Aidan Do
aidando73
Follow
0 followers
·
2 following
aidando73
aidando73
aidando73
AI & ML interests
None yet
Organizations
None yet
aidando73
's models
37
Sort: Recently updated
aidando73/simplerl-v8-checkpoints
Updated
Apr 6
aidando73/simplerl-Qwen2.5-Math-7B-v5-checkpoint40
8B
•
Updated
Apr 4
•
9
aidando73/simplerl-v5-checkpoints
Updated
Apr 4
aidando73/simplerl-v6-checkpoints
Updated
Apr 4
aidando73/simplerl-v4-checkpoints
Updated
Apr 2
aidando73/simplerl-single-grpo-v1-checkpoints
Updated
Mar 29
aidando73/Qwen-2.5-7B-Simple-RL-v9
Text Generation
•
8B
•
Updated
Mar 24
•
7
aidando73/Qwen-2.5-7B-Simple-RL-v8
Text Generation
•
8B
•
Updated
Mar 24
•
11
aidando73/Qwen-2.5-7B-Simple-RL-v7
Text Generation
•
8B
•
Updated
Mar 23
•
13
aidando73/Qwen-2.5-7B-Simple-RL-v6
Text Generation
•
8B
•
Updated
Mar 23
•
14
aidando73/Qwen-2.5-7B-Simple-RL-v5
Text Generation
•
8B
•
Updated
Mar 23
•
11
aidando73/Qwen-2.5-7B-Simple-RL-v4
Updated
Mar 22
aidando73/Qwen-2.5-7B-Simple-RL-v3
Text Generation
•
8B
•
Updated
Mar 22
•
9
aidando73/Qwen-2.5-7B-Simple-RL-v2
Text Generation
•
8B
•
Updated
Mar 22
•
13
aidando73/Qwen-2.5-7B-Simple-RL-v1
Text Generation
•
8B
•
Updated
Mar 21
•
12
aidando73/grpo-big-math-rl-v2
Updated
Mar 21
aidando73/llama-3.1-8b-grpo-big-math-rl-v3-checkpoints
Updated
Mar 20
aidando73/llama-3.1-8b-grpo-big-math-rl-v2-checkpoints
Updated
Mar 20
aidando73/Qwen2-0.5B-GRPO-summarize-2025-03-17-20750
Text Generation
•
0.5B
•
Updated
Mar 18
•
20
aidando73/Qwen2-0.5B-summarize-SFT-2025-03-17-43773
Text Generation
•
0.5B
•
Updated
Mar 17
•
14
aidando73/Qwen2-0.5B-GRPO-20750
Text Generation
•
0.5B
•
Updated
Mar 17
•
22
aidando73/llama-3.1-8b-grpo-checkpoints
Updated
Mar 17
aidando73/Qwen2-0.5B-summarize-SFT-2025-03-17
Updated
Mar 17
aidando73/llama-3.1-8b-grpo-33000-merged
Text Generation
•
8B
•
Updated
Mar 17
•
9
aidando73/Qwen2-0.5B-GRPO-8250
Text Generation
•
0.5B
•
Updated
Mar 17
•
17
aidando73/llama-3.1-8b-grpo-19500-merged
Text Generation
•
8B
•
Updated
Mar 16
•
48
aidando73/llama-3.1-8b-grpo-10500-merged
Text Generation
•
8B
•
Updated
Mar 15
•
11
aidando73/llama-3.1-8b-4bit-merged
Text Generation
•
8B
•
Updated
Mar 15
•
11
aidando73/qwen2.5-3b-4bit-merged
Text Generation
•
3B
•
Updated
Mar 15
•
7
aidando73/llama-3.1-8b-grpo-4bit-merged
Text Generation
•
8B
•
Updated
Mar 15
•
8
Previous
1
2
Next