zhang's picture

6 1

zhang

kekueknu2

·

AI & ML interests

None yet

Recent Activity

upvoted an article 29 days ago

From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning

upvoted an article 6 months ago

Illustrating Reinforcement Learning from Human Feedback (RLHF)

updated a Space 6 months ago

kekueknu2/wewe-rss

View all activity

Organizations

kekueknu2's activity

upvoted an article 29 days ago

Article

From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning

By

•

Feb 4

• 12

upvoted an article 6 months ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Dec 9, 2022

• 198

upvoted a collection 11 months ago

LLM papers

It is a collection of papers that are useful in studying LLM. • 14 items • Updated Apr 3, 2024 • 12

upvoted a paper 11 months ago

Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15, 2024 • 105

upvoted 2 collections 11 months ago

Foundation AI Papers

Curated List of Must-Reads on LLM reasoning at Temus AI team • 135 items • Updated Jun 15, 2024 • 32

Reading Papers

227 items • Updated 16 days ago • 10