Temporal Consistency for LLM Reasoning Process Error Identification Paper • 2503.14495 • Published 7 days ago • 9
CapArena: Benchmarking and Analyzing Detailed Image Captioning in the LLM Era Paper • 2503.12329 • Published 10 days ago • 24
DAPO: An Open-Source LLM Reinforcement Learning System at Scale Paper • 2503.14476 • Published 7 days ago • 104
MathFusion: Enhancing Mathematic Problem-solving of LLM through Instruction Fusion Paper • 2503.16212 • Published 5 days ago • 21
LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds Paper • 2503.10625 • Published 12 days ago • 22
One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation Paper • 2503.13358 • Published 8 days ago • 88
Lost in Cultural Translation: Do LLMs Struggle with Math Across Cultural Contexts? Paper • 2503.18018 • Published 2 days ago • 4
Typed-RAG: Type-aware Multi-Aspect Decomposition for Non-Factoid Question Answering Paper • 2503.15879 • Published 6 days ago • 6
Video SimpleQA: Towards Factuality Evaluation in Large Video Language Models Paper • 2503.18923 • Published 1 day ago • 10
I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders Paper • 2503.18878 • Published 1 day ago • 79
SPIN-Bench: How Well Do LLMs Plan Strategically and Reason Socially? Paper • 2503.12349 • Published 10 days ago • 38