zhang's picture
6 1

zhang

kekueknu2
·

AI & ML interests

None yet

Recent Activity

Organizations

san's profile picture san's profile picture

kekueknu2's activity

upvoted an article 29 days ago
view article
Article

From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning

By NormalUhr •
• 12
upvoted an article 6 months ago
view article
Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

• 198