AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset Paper ā¢ 2503.19462 ā¢ Published 3 days ago ā¢ 5
Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models Paper ā¢ 2501.08453 ā¢ Published Jan 14 ā¢ 1
CFG-Zero*: Improved Classifier-Free Guidance for Flow Matching Models Paper ā¢ 2503.18886 ā¢ Published 3 days ago ā¢ 16
Decouple-Then-Merge: Finetune Diffusion Models as Multi-Task Learning Paper ā¢ 2410.06664 ā¢ Published Oct 9, 2024 ā¢ 1
VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models Paper ā¢ 2411.13503 ā¢ Published Nov 20, 2024 ā¢ 34
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text Paper ā¢ 2406.08418 ā¢ Published Jun 12, 2024 ā¢ 29
VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking Paper ā¢ 2303.16727 ā¢ Published Mar 29, 2023
VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models Paper ā¢ 2411.13503 ā¢ Published Nov 20, 2024 ā¢ 34
VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling Paper ā¢ 2501.00574 ā¢ Published Dec 31, 2024 ā¢ 6
InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling Paper ā¢ 2501.12386 ā¢ Published Jan 21 ā¢ 1
DiffVSR: Enhancing Real-World Video Super-Resolution with Diffusion Models for Advanced Visual Quality and Temporal Consistency Paper ā¢ 2501.10110 ā¢ Published Jan 17
Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models Paper ā¢ 2501.08453 ā¢ Published Jan 14 ā¢ 1
V-STaR: Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning Paper ā¢ 2503.11495 ā¢ Published 13 days ago ā¢ 11