NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale Paper • 2508.10711 • Published 4 days ago • 127
Mol-R1: Towards Explicit Long-CoT Reasoning in Molecule Discovery Paper • 2508.08401 • Published 7 days ago • 37
WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent Paper • 2508.05748 • Published 11 days ago • 112
Omni-Effects: Unified and Spatially-Controllable Visual Effects Generation Paper • 2508.07981 • Published 7 days ago • 56
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper • 2508.06471 • Published 10 days ago • 143
DeepPHY: Benchmarking Agentic VLMs on Physical Reasoning Paper • 2508.05405 • Published 11 days ago • 61
SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience Paper • 2508.04700 • Published 12 days ago • 46
Efficient Agents: Building Effective Agents While Reducing Cost Paper • 2508.02694 • Published 25 days ago • 79
Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving Paper • 2507.23726 • Published 18 days ago • 106
Phi-Ground Tech Report: Advancing Perception in GUI Grounding Paper • 2507.23779 • Published 18 days ago • 41
ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents Paper • 2507.22827 • Published 19 days ago • 91
HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels Paper • 2507.21809 • Published 20 days ago • 123
MUR: Momentum Uncertainty guided Reasoning for Large Language Models Paper • 2507.14958 • Published 29 days ago • 46
DesignLab: Designing Slides Through Iterative Detection and Correction Paper • 2507.17202 • Published 26 days ago • 50