Submitted by RichardQRQ 118 We-Math 2.0: A Versatile MathBook System for Incentivizing Visual Mathematical Reasoning · 14 authors 111 6
Submitted by yuangpeng 101 NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale · 50 authors 278 5
Submitted by l-li 33 ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing · 9 authors 55 2
Submitted by ttchungc 33 PRELUDE: A Benchmark Designed to Require Global Comprehension and Reasoning over Long Contexts · 11 authors 2
Submitted by zhangxgu 18 UI-Venus Technical Report: Building High-performance UI Agents with RFT · 24 authors 0 2
Submitted by yslan 14 STream3R: Scalable Sequential 3D Reconstruction with Causal Transformer · 10 authors 45 3
Submitted by TimothyCzp 9 Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models · 7 authors 8 2
Submitted by RobinRoaR 7 HumanSense: From Multimodal Perception to Empathetic Context-Aware Responses through Reasoning MLLMs · 7 authors 2
Submitted by stojnvla 3 Processing and acquisition traces in visual encoders: What does CLIP know about your camera? · 6 authors 0 2
Submitted by Geralt-Targaryen 2 From Black Box to Transparency: Enhancing Automated Interpreting Assessment with Explainable AI in College Classrooms · 2 authors 2
Submitted by mdhaini - When Explainability Meets Privacy: An Investigation at the Intersection of Post-hoc Explainability and Differential Privacy in the Context of Natural Language Processing · 5 authors 1 2