STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution Paper • 2501.02976 • Published 6 days ago • 46
SeedVR: Seeding Infinity in Diffusion Transformer Towards Generic Video Restoration Paper • 2501.01320 • Published 10 days ago • 10
VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control Paper • 2501.01427 • Published 10 days ago • 46
VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Models Paper • 2412.19645 • Published 16 days ago • 13
VidTwin: Video VAE with Decoupled Structure and Dynamics Paper • 2412.17726 • Published 20 days ago • 8
DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation Paper • 2412.18597 • Published 19 days ago • 19
Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization Paper • 2412.17739 • Published 20 days ago • 39
Large Motion Video Autoencoding with Cross-modal Video VAE Paper • 2412.17805 • Published 20 days ago • 24
CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up Paper • 2412.16112 • Published 23 days ago • 21
Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis Paper • 2412.15322 • Published 24 days ago • 18
AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation Paper • 2412.15191 • Published 24 days ago • 5
Autoregressive Video Generation without Vector Quantization Paper • 2412.14169 • Published 25 days ago • 14
VividFace: A Diffusion-Based Hybrid Framework for High-Fidelity Video Face Swapping Paper • 2412.11279 • Published 28 days ago • 12
Causal Diffusion Transformers for Generative Modeling Paper • 2412.12095 • Published 27 days ago • 23
Byte Latent Transformer: Patches Scale Better Than Tokens Paper • 2412.09871 • Published about 1 month ago • 85
Arbitrary-steps Image Super-resolution via Diffusion Inversion Paper • 2412.09013 • Published Dec 12, 2024 • 11
FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models Paper • 2412.08629 • Published Dec 11, 2024 • 11