Zhiwei He
zwhe99
AI & ML interests
Natural Language Processing
Recent Activity
updated
a model
4 days ago
zwhe99/DeepSeek-R1-Distill-Qwen-1.5B
published
a model
4 days ago
zwhe99/DeepSeek-R1-Distill-Qwen-1.5B
upvoted
a
paper
20 days ago
Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs
Organizations
None yet