Jiaheng Liu's picture

Jiaheng Liu

CheeryLJH

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning

commented on a paper 6 days ago

Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning

upvoted a paper 6 days ago

WideSearch: Benchmarking Agentic Broad Info-Seeking

View all activity

Organizations

upvoted 2 papers 6 days ago

Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning

Paper • 2508.08221 • Published 6 days ago • 36

WideSearch: Benchmarking Agentic Broad Info-Seeking

Paper • 2508.07999 • Published 7 days ago • 99

upvoted a paper 11 days ago

VeriGUI: Verifiable Long-Chain GUI Dataset

Paper • 2508.04026 • Published 12 days ago • 137

upvoted a paper 12 days ago

Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference

Paper • 2508.02193 • Published 14 days ago • 123

upvoted 6 papers about 1 month ago

Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology

Paper • 2507.07999 • Published Jul 10 • 47

A Systematic Analysis of Hybrid Linear Attention

Paper • 2507.06457 • Published Jul 8 • 22

A Survey on Latent Reasoning

Paper • 2507.06203 • Published Jul 8 • 88

CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization

Paper • 2507.06181 • Published Jul 8 • 41

Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving

Paper • 2507.06229 • Published Jul 8 • 72

ArtifactsBench: Bridging the Visual-Interactive Gap in LLM Code Generation Evaluation

Paper • 2507.04952 • Published Jul 7 • 9

upvoted 3 papers about 2 months ago

MMSearch-R1: Incentivizing LMMs to Search

Paper • 2506.20670 • Published Jun 25 • 62

OAgents: An Empirical Study of Building Effective Agents

Paper • 2506.15741 • Published Jun 17 • 35

REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Paper • 2505.24760 • Published May 30 • 69

upvoted 7 papers 2 months ago

Scaling Test-time Compute for LLM Agents

Paper • 2506.12928 • Published Jun 15 • 61

TaskCraft: Automated Generation of Agentic Tasks

Paper • 2506.10055 • Published Jun 11 • 32

Scientists' First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning

Paper • 2506.10521 • Published Jun 12 • 74

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16 • 261

Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning

Paper • 2506.07044 • Published Jun 8 • 112

Seedance 1.0: Exploring the Boundaries of Video Generation Models

Paper • 2506.09113 • Published Jun 10 • 101

SWE-Flow: Synthesizing Software Engineering Data in a Test-Driven Manner

Paper • 2506.09003 • Published Jun 10 • 19