AI & ML interests

Reinforcement Learning, Large Language Models, Value Alignment

Recent Activity

XuyaoWang  published a dataset about 17 hours ago
PKU-Alignment/s1-m_beta
XuyaoWang  updated a dataset about 17 hours ago
PKU-Alignment/s1-m_beta
XuyaoWang  updated a model about 17 hours ago
PKU-Alignment/s1-m_7b_beta
View all activity

PKU-Alignment's activity