Jiang Jiwen

jjw0126
·

AI & ML interests

RL, LLM

Recent Activity

liked a dataset 9 days ago
sanbu/tianji-chinese
liked a dataset 14 days ago
adyen/DABstep
liked a dataset 14 days ago
ibm-granite/GneissWeb
View all activity

Organizations

ucas's profile picture ELM Team's profile picture PLM-Team's profile picture

jjw0126's activity

upvoted 2 articles 22 days ago
view article
Article

Open-R1: a fully open reproduction of DeepSeek-R1

803
view article
Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

By NormalUhr
70