SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer Paper • 2501.18427 • Published Jan 30 • 21
Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity Paper • 2502.01776 • Published Feb 3 • 3
Accelerate High-Quality Diffusion Models with Inner Loop Feedback Paper • 2501.13107 • Published Jan 22 • 2
SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation Paper • 2503.09641 • Published Mar 12 • 40
DC-AR: Efficient Masked Autoregressive Image Generation with Deep Compression Hybrid Tokenizer Paper • 2507.04947 • Published Jul 7
DC-AE 1.5: Accelerating Diffusion Model Convergence with Structured Latent Space Paper • 2508.00413 • Published 17 days ago • 2
Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation Paper • 2505.18875 • Published May 24 • 42
LongVILA: Scaling Long-Context Visual Language Models for Long Videos Paper • 2408.10188 • Published Aug 19, 2024 • 53
VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation Paper • 2409.04429 • Published Sep 6, 2024
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers Paper • 2410.10629 • Published Oct 14, 2024 • 12
COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training Paper • 2410.19313 • Published Oct 25, 2024 • 19
TinyTL: Reduce Activations, Not Trainable Parameters for Efficient On-Device Learning Paper • 2007.11622 • Published Jul 22, 2020
COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training Paper • 2410.19313 • Published Oct 25, 2024 • 19
Long Text Generation via Adversarial Training with Leaked Information Paper • 1709.08624 • Published Sep 24, 2017
TinyTL: Reduce Activations, Not Trainable Parameters for Efficient On-Device Learning Paper • 2007.11622 • Published Jul 22, 2020
Real-Time Bidding by Reinforcement Learning in Display Advertising Paper • 1701.02490 • Published Jan 10, 2017