arxiv:2412.15204
Yuxiao Dong
yuxiaod
AI & ML interests
None yet
Recent Activity
authored
a paper
4 days ago
LongBench v2: Towards Deeper Understanding and Reasoning on Realistic
Long-context Multitasks
authored
a paper
7 days ago
SPaR: Self-Play with Tree-Search Refinement to Improve
Instruction-Following in Large Language Models
authored
a paper
about 2 months ago
AndroidLab: Training and Systematic Benchmarking of Android Autonomous
Agents
Organizations
Papers
15
models
None public yet
datasets
None public yet