Rykov Elisei

lmeribal
·

AI & ML interests

NLP, Multimodality

Recent Activity

Organizations

s-nlp's profile picture

lmeribal's activity

upvoted an article 7 days ago
view article
Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

By NormalUhr
66