BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games Paper β’ 2411.13543 β’ Published Nov 20, 2024 β’ 18
OminiControl: Minimal and Universal Control for Diffusion Transformer Paper β’ 2411.15098 β’ Published Nov 22, 2024 β’ 53
VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models Paper β’ 2411.13503 β’ Published Nov 20, 2024 β’ 30
AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers Paper β’ 2411.18673 β’ Published Nov 27, 2024 β’ 8
Training Language Models to Self-Correct via Reinforcement Learning Paper β’ 2409.12917 β’ Published Sep 19, 2024 β’ 136
CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities Paper β’ 2408.13239 β’ Published Aug 23, 2024 β’ 11
TrackGo: A Flexible and Efficient Method for Controllable Video Generation Paper β’ 2408.11475 β’ Published Aug 21, 2024 β’ 17
InFusion: Inpainting 3D Gaussians via Learning Depth Completion from Diffusion Prior Paper β’ 2404.11613 β’ Published Apr 17, 2024 β’ 11