Yihua Zhang

NormalUhr

AI & ML interests

None yet

Recent Activity

Organizations

OPTML Group @ MSU's profile picture

NormalUhr's activity

published an article 3 days ago
view article
Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

By NormalUhr •
• 12
published an article 6 days ago
view article
Article

A Review on the Evolvement of Load Balancing Strategy in MoE LLMs: Pitfalls and Lessons

By NormalUhr •
• 2
published an article 6 days ago
view article
Article

From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning

By NormalUhr •
• 4
published an article 6 days ago
view article
Article

MLA: Redefining KV-Cache Through Low-Rank Projections and On-Demand Decompression

By NormalUhr •
• 3
upvoted an article 5 months ago
view article
Article

Optimizing your LLM in production

• 15
New activity in OPTML-Group/UnlearnCanvas 8 months ago