Rui Yang's picture

9 7 18

Rui Yang

Ray2333

·

https://yangrui2015.github.io

YangRui2015

AI & ML interests

Deep Reinforcement Learning

Recent Activity

liked a model 2 months ago

Ray2333/GRM_Llama3.1_8B_rewardmodel-ft

updated a model 2 months ago

Ray2333/GRM-Llama3-8B-rewardmodel-ft

updated a model 2 months ago

Ray2333/GRM-gemma2-2B-rewardmodel-ft

View all activity

Organizations

Ray2333's activity

upvoted 2 collections 2 months ago

Papers - Math - Reasoning

11 items • Updated Nov 10, 2024 • 1

Papers - Benchmarks - Math

4 items • Updated Nov 5, 2024 • 1

upvoted a paper 3 months ago

DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models

Paper • 2411.00836 • Published Oct 29, 2024 • 15

upvoted a collection 7 months ago

GRM

Generalizable Reward Models • 11 items • Updated Nov 25, 2024 • 4

upvoted a paper 8 months ago

Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs

Paper • 2406.10216 • Published Jun 14, 2024 • 2

upvoted 2 papers 9 months ago

Towards Robust Offline Reinforcement Learning under Diverse Data Corruption

Paper • 2310.12955 • Published Oct 19, 2023 • 1

Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment

Paper • 2402.10207 • Published Feb 15, 2024 • 2