Zeyu Qin's picture

43 39

Zeyu Qin

qqqzzzyyy

·

https://alan-qin.github.io/

Alan-Qin

AI & ML interests

Scalable Oversight, AI safety

Recent Activity

upvoted a collection about 21 hours ago

upvoted a paper about 21 hours ago

AWorld: Dynamic Multi-Agent System with Stable Maneuvering for Robust GAIA Problem Solving

upvoted a paper 6 days ago

Beyond Ten Turns: Unlocking Long-Horizon Agentic Search with Large-Scale Asynchronous RL

View all activity

Organizations

None yet

upvoted a collection about 21 hours ago

agent

171 items • Updated 1 day ago • 10

upvoted a paper about 21 hours ago

AWorld: Dynamic Multi-Agent System with Stable Maneuvering for Robust GAIA Problem Solving

Paper • 2508.09889 • Published 6 days ago • 29

upvoted a paper 6 days ago

Beyond Ten Turns: Unlocking Long-Horizon Agentic Search with Large-Scale Asynchronous RL

Paper • 2508.07976 • Published 8 days ago • 44

upvoted a collection 8 days ago

DataMan

4 items • Updated 11 days ago • 2

upvoted a paper 21 days ago

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Paper • 2507.21046 • Published 21 days ago • 79

upvoted a paper about 1 month ago

SWE-Perf: Can Language Models Optimize Code Performance on Real-World Repositories?

Paper • 2507.12415 • Published Jul 16 • 41

upvoted a paper about 2 months ago

WebSailor: Navigating Super-human Reasoning for Web Agent

Paper • 2507.02592 • Published Jul 3 • 110

upvoted a collection about 2 months ago

hahah

1 item • Updated Jun 26 • 1

upvoted 2 articles 3 months ago

Article

BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks

By

and 8 others •

Jun 18, 2024

• 52

Article

Open R1: Update #3

By

and 9 others •

Mar 11

• 295

upvoted 2 collections 5 months ago

Cognitive Behaviors

4 items • Updated Mar 19 • 2

DeepSeek-R1

10 items • Updated May 29 • 780

upvoted a collection 6 months ago

NuminaMath

Datasets and models for training SOTA math LLMs. See our GitHub for training & inference code: https://github.com/project-numina/aimo-progress-prize • 7 items • Updated Feb 10 • 78

upvoted 4 papers 6 months ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20 • 105

Revisiting the Test-Time Scaling of o1-like Models: Do they Truly Possess Test-Time Scaling Capabilities?

Paper • 2502.12215 • Published Feb 17 • 16

CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

Paper • 2502.07316 • Published Feb 11 • 50

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 126

upvoted a paper 7 months ago

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Paper • 2501.13629 • Published Jan 23 • 49

upvoted a paper 8 months ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 85

upvoted a paper 9 months ago

AgentInstruct: Toward Generative Teaching with Agentic Flows

Paper • 2407.03502 • Published Jul 3, 2024 • 51