dixiang

didiqwq

AI & ML interests

None yet

Recent Activity

Organizations

None yet

didiqwq's activity

upvoted an article 15 days ago
view article
Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

By NormalUhr
44
New activity in MMInstruction/M3IT over 1 year ago
New activity in liuhaotian/LLaVA-CC3M-Pretrain-595K almost 2 years ago

Question about download

6
#1 opened almost 2 years ago by
didiqwq