Long-Video Audio Synthesis with Multi-Agent Collaboration Paper • 2503.10719 • Published 12 days ago • 9
Running on CPU Upgrade 691 691 Open ASR Leaderboard 🏆 Request and view assessments for speech recognition models
TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding Paper • 2502.19400 • Published 27 days ago • 45
ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation Paper • 2502.18364 • Published 28 days ago • 34
MutaGReP: Execution-Free Repository-Grounded Plan Search for Code-Use Paper • 2502.15872 • Published Feb 21 • 5