Jian Hu's picture

Jian Hu

chuyi777

·

https://hujian.website

hijkzzz

AI & ML interests

Reinforcement Learning

Recent Activity

upvoted a paper 8 days ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

commented on a paper 8 days ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

liked a dataset 12 days ago

AI-MO/NuminaMath-CoT

View all activity

Organizations

chuyi777's activity

upvoted a paper 8 days ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published 12 days ago • 79

upvoted a paper about 1 month ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 77

upvoted an article 7 months ago

Article

4D masks support in Transformers

By

•

Jan 8, 2024

• 15

upvoted a paper 8 months ago

OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework

Paper • 2405.11143 • Published May 20, 2024 • 36