Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
174.4
TFLOPS
3
15
79
rasdani
rasdani
Follow
Jofthomas's profile picture
bjoernp's profile picture
D4ve-R's profile picture
18 followers
·
74 following
rasdani_
rasdani
rasdani
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 1 hour ago
rasdani/SWE-bench_Lite_oracle_32k
published
a dataset
about 1 hour ago
rasdani/SWE-bench_Lite_oracle_32k
liked
a dataset
2 days ago
EssentialAI/essential-web-v1.0
View all activity
Organizations
rasdani
's models
27
Sort: Recently updated
rasdani/git-diff-Qwen-4B
Updated
3 days ago
•
13
rasdani/git-diff-Qwen-1.7B
Updated
4 days ago
•
12
rasdani/git-diff-Qwen-1.7-B
Updated
4 days ago
•
14
rasdani/simple-math-Qwen-1.5B
Updated
5 days ago
•
5
rasdani/qwen3_0_6b_function_rm
Updated
29 days ago
•
25
rasdani/Qwen2.5-0.5B-simpleRL-Zoo-8192k
Updated
Apr 8
•
13
rasdani/Qwen2.5-0.5B-simpleRL-Zoo
Text Generation
•
Updated
Apr 6
•
7
rasdani/smolR1-Qwen2.5-0.5B
Text Generation
•
Updated
Mar 31
•
10
rasdani/Qwen2.5-0.5B-simpleRL-Zoo-no-KL
Updated
Mar 31
rasdani/Qwen2.5-0.5B-simpleRL-Zoo-3072k
Updated
Mar 31
rasdani/Qwen2.5-0.5B-simpleRL-Zoo-4096k
Updated
Mar 31
rasdani/Qwen2.5-0.5B-simpleRL-Zoo-2560k
Updated
Mar 31
rasdani/Qwen2.5-0.5B-simpleRL-Zoo-2048k
Updated
Mar 31
rasdani/Qwen2.5-0.5B-simpleRL-Zoo-first-try
Updated
Mar 29
•
13
rasdani/Qwen-1.5B-Distill-GRPO
Text Generation
•
Updated
Mar 28
•
10
rasdani/Qwen-0.5B-Instruct-GRPO
Updated
Mar 27
rasdani/gsm8k_qwen2.5-0.5b
Updated
Mar 11
•
9
rasdani/Qwen2.5-1.5B-Open-R1-Code-GRPO
Updated
Mar 9
rasdani/Qwen2.5-0.5B-Open-R1-Code-GRPO
Text Generation
•
Updated
Mar 8
•
11
rasdani/Qwen2.5-7B-Instruct-GRPO-unsloth
Text Generation
•
Updated
Mar 2
•
15
rasdani/Qwen2.5-3B-Instruct-GRPO-unsloth
Text Generation
•
Updated
Mar 1
•
13
rasdani/Qwen-7B-Instruct-GRPO-unsloth
Text Generation
•
Updated
Feb 27
•
10
rasdani/meta-Llama-3.1-8B-Instruct-GRPO-unsloth
Text Generation
•
Updated
Feb 26
•
10
rasdani/Qwen2.5-1.5B-Instruct-GRPO-rg
Text Generation
•
Updated
Feb 25
•
10
rasdani/meta-Llama-3.1-8B-Instruct-GRPO-rg
Updated
Feb 24
•
140
rasdani/qwen2-math-7b-step-dpo
Text Generation
•
Updated
Aug 29, 2024
•
15
rasdani/qwen2-math-1_5b-step-dpo
Text Generation
•
Updated
Aug 28, 2024
•
18