Submitted by chongjie 67 Light of Normals: Unified Feature Representation for Universal Photometric Stereo · 14 authors 85 2
Submitted by mozhu 34 LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning · 5 authors 1
Submitted by ZhuoweiChen 24 Phantom-Data : Towards a General Subject-Consistent Video Generation Dataset · 11 authors 25 2
Submitted by michaal94 24 ViDAR: Video Diffusion-Aware 4D Reconstruction From Monocular Inputs · 6 authors 1
Submitted by Lingaaaaaaa 22 ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs · 7 authors 413 1
Submitted by Yirany 22 RLPR: Extrapolating RLVR to General Domains without Verifiers · 12 authors 26 3
Submitted by csuhan 21 Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations · 9 authors 41 1
Submitted by liguang0115 11 VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory · 4 authors 43 1
Submitted by sgidaris 9 DIP: Unsupervised Dense In-Context Post-training of Visual Representations · 5 authors 1
Submitted by vyokky 8 LettinGo: Explore User Profile Generation for Recommendation System · 12 authors 1
Submitted by LogicTrainer 6 TC-Light: Temporally Consistent Relighting for Dynamic Long Videos · 9 authors 18 1
Submitted by cliang1453 6 SlimMoE: Structured Compression of Large MoE Models via Expert Slimming and Distillation · 7 authors 1
Submitted by manglu3935 6 Enhancing Step-by-Step and Verifiable Medical Reasoning in MLLMs · 9 authors 5 2
Submitted by natnitaract 6 FinCoT: Grounding Chain-of-Thought in Expert Financial Reasoning · 6 authors 8 1
Submitted by ashmrz 6 4Real-Video-V2: Fused View-Time Attention and Feedforward Reconstruction for 4D Scene Generation · 12 authors 1
Submitted by kittttttt 5 ReDit: Reward Dithering for Improved LLM Policy Optimization · 6 authors 1 1
Submitted by kamahori 5 ConsumerBench: Benchmarking Generative AI Applications on End-User Devices · 6 authors 3 1
Submitted by Neo111x 4 I Know Which LLM Wrote Your Code Last Summer: LLM generated Code Stylometry for Authorship Attribution · 9 authors 0 1
Submitted by vanshs1 3 Steering Conceptual Bias via Transformer Latent-Subspace Activation · 2 authors 1
Submitted by seonglae 3 FaithfulSAE: Towards Capturing Faithful Features with Sparse Autoencoders without External Dataset Dependencies · 6 authors 1
Submitted by BoKelvin 2 GEMeX-ThinkVG: Towards Thinking with Visual Grounding in Medical VQA via Reinforcement Learning · 6 authors 1
Submitted by akanatas 2 CultureMERT: Continual Pre-Training for Cross-Cultural Music Representation Learning · 3 authors 1
Submitted by rajandasgupta 2 A deep learning and machine learning approach to predict neonatal death in the context of São Paulo · 9 authors 1
Submitted by xunguangwang 2 SoK: Evaluating Jailbreak Guardrails for Large Language Models · 6 authors 1
Submitted by tahirakazimi77 1 Audit & Repair: An Agentic Framework for Consistent Story Visualization in Text-to-Image Diffusion Models · 3 authors 1