Zhaolin Gao

GitBag

AI & ML interests

Reinforcement Learning from Human Feedback

Recent Activity

updated a dataset about 1 hour ago
GitBag/1744064489
updated a dataset about 2 hours ago
GitBag/1743982255
updated a dataset about 2 hours ago
GitBag/1743993729
View all activity

Organizations

Cornell-AGI's profile picture

Articles 1

Article
6

RLHF 101: A Technical Dive into RLHF