dame rajee's picture

dame rajee

damerajee

AI & ML interests

None yet

Recent Activity

View all activity

Organizations

Blog-explorers's profile picture Samanvay AI's profile picture None yet's profile picture

damerajee's activity

reacted to lewtun's post with šŸ”„šŸ¤—šŸš€ about 9 hours ago
view post
Post
4168
We are reproducing the full DeepSeek R1 data and training pipeline so everybody can use their recipe. Instead of doing it in secret we can do it together in the open!

šŸ§Ŗ Step 1: replicate the R1-Distill models by distilling a high-quality reasoning corpus from DeepSeek-R1.

šŸ§  Step 2: replicate the pure RL pipeline that DeepSeek used to create R1-Zero. This will involve curating new, large-scale datasets for math, reasoning, and code.

šŸ”„ Step 3: show we can go from base model -> SFT -> RL via multi-stage training.

Follow along: https://github.com/huggingface/open-r1
  • 1 reply
Ā·
upvoted an article 2 days ago
view article
Article

**Topic 24: What is Cosmos World Foundation Model Platform?**

By Kseniase ā€¢
ā€¢ 6
upvoted an article 10 days ago
view article
Article

Timm ā¤ļø Transformers: Use any timm model with transformers

ā€¢ 34