arxiv:2410.01044
Jiang
Dongwei
AI & ML interests
None yet
Recent Activity
updated
a model
about 13 hours ago
Dongwei/Qwen-2.5-7B_Base_Math_smallestlr_newdata
updated
a model
about 13 hours ago
Dongwei/Qwen-2.5-7B_Base_Math_smallestlr
published
a model
about 16 hours ago
Dongwei/Qwen-2.5-7B_Base_Math_smallestlr_newdata
Organizations
Papers
3
models
15
Dongwei/Qwen-2.5-7B_Base_Math_smallestlr_newdata
Text Generation
•
Updated
Dongwei/Qwen-2.5-7B_Base_Math_smallestlr
Text Generation
•
Updated
Dongwei/Qwen-2.5-7B_Base_Math_smalllr
Text Generation
•
Updated
•
10
Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math_lowlr
Text Generation
•
Updated
•
4
Dongwei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math_smalllr
Text Generation
•
Updated
•
4
Dongwei/Qwen2.5-1.5B-Open-R1-GRPO_Math_smalllr
Text Generation
•
Updated
•
6
Dongwei/Qwen-2.5-7B_Math_smalllr
Text Generation
•
Updated
•
4
Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math
Text Generation
•
Updated
•
14
Dongwei/Qwen-2.5-7B_Math
Text Generation
•
Updated
•
18
Dongwei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math
Text Generation
•
Updated
•
14