30 16 2

Yulei Qin

yolay

https://yuleichin.github.io/

AI & ML interests

Medical Imaging, Computer Vision, Language Models

Recent Activity

upvoted a paper 3 days ago

Complex Logical Instruction Generation

upvoted a paper 3 days ago

OpenCUA: Open Foundations for Computer-Use Agents

upvoted a paper 11 days ago

Beyond the Trade-off: Self-Supervised Reinforcement Learning for Reasoning Models' Instruction Following

View all activity

Organizations

None yet

upvoted 2 papers 3 days ago

Complex Logical Instruction Generation

Paper • 2508.09125 • Published 5 days ago • 38

OpenCUA: Open Foundations for Computer-Use Agents

Paper • 2508.09123 • Published 5 days ago • 21

upvoted a paper 11 days ago

Beyond the Trade-off: Self-Supervised Reinforcement Learning for Reasoning Models' Instruction Following

Paper • 2508.02150 • Published 13 days ago • 34

upvoted a paper 15 days ago

HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels

Paper • 2507.21809 • Published 19 days ago • 122

upvoted 2 papers about 1 month ago

Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination

Paper • 2507.10532 • Published Jul 14 • 85

WebSailor: Navigating Super-human Reasoning for Web Agent

Paper • 2507.02592 • Published Jul 3 • 110

upvoted a collection about 2 months ago

RAIF

Collection

Datasets and models in the paper "Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models" [github.com/yuleiqin/RAIF]. • 12 items • Updated Jul 17 • 1

upvoted a paper 2 months ago

WebChoreArena: Evaluating Web Browsing Agents on Realistic Tedious Web Tasks

Paper • 2506.01952 • Published Jun 2 • 10

upvoted a paper 3 months ago

Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models

Paper • 2506.01413 • Published Jun 2 • 15

upvoted an article 6 months ago

Article

Open-R1: Update #1

and 7 others •

Feb 2

• 305

upvoted a collection 6 months ago

🧠 Reasoning datasets

Collection

Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19 • 163

upvoted an article 6 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

and 2 others •

Jan 28

• 877

upvoted 2 papers 10 months ago

GPT-4o System Card

Paper • 2410.21276 • Published Oct 25, 2024 • 85

Rethinking Data Selection at Scale: Random Selection is Almost All You Need

Paper • 2410.09335 • Published Oct 12, 2024 • 17

upvoted a paper 12 months ago

Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models

Paper • 2408.15915 • Published Aug 28, 2024 • 20

upvoted a paper about 1 year ago

Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models

Paper • 2408.02085 • Published Aug 4, 2024 • 19