Submitted by chujiezheng 80 ProcessBench: Identifying Process Errors in Mathematical Reasoning · 9 authors 6
Submitted by Shibo-UCSD 79 Training Large Language Models to Reason in a Continuous Latent Space · 7 authors 7
Submitted by avanturist 71 Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation · 5 authors 2
Submitted by nicolas-dufour 21 Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation · 4 authors 2
Submitted by LooperXX 16 Exploring Multi-Grained Concept Annotations for Multimodal Large Language Models · 6 authors 2
Submitted by tttoaster 16 Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation · 4 authors 2
Submitted by xinlongwang 13 You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale · 7 authors 3
Submitted by Hidir 9 MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance · 4 authors 2
Submitted by mikonvergence 8 Global and Dense Embeddings of Earth: Major TOM Floating in the Latent Space · 3 authors 2
Submitted by huangsiteng 7 CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction · 8 authors 2
Submitted by AntoineGuedon 7 MAtCha Gaussians: Atlas of Charts for High-Quality Geometry and Photorealism From Sparse Views · 4 authors 2
Submitted by mkhalifa 5 If You Can't Use Them, Recycle Them: Optimizing Merging at Scale Mitigates Performance Tradeoffs · 9 authors 2