SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer Paper β’ 2501.18427 β’ Published Jan 30 β’ 21
Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity Paper β’ 2502.01776 β’ Published Feb 3 β’ 3
Accelerate High-Quality Diffusion Models with Inner Loop Feedback Paper β’ 2501.13107 β’ Published Jan 22 β’ 2
SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation Paper β’ 2503.09641 β’ Published Mar 12 β’ 40
DC-AR: Efficient Masked Autoregressive Image Generation with Deep Compression Hybrid Tokenizer Paper β’ 2507.04947 β’ Published Jul 7
DC-AE 1.5: Accelerating Diffusion Model Convergence with Structured Latent Space Paper β’ 2508.00413 β’ Published 17 days ago β’ 2
DC-AE 1.5: Accelerating Diffusion Model Convergence with Structured Latent Space Paper β’ 2508.00413 β’ Published 17 days ago β’ 2
Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation Paper β’ 2507.01957 β’ Published Jul 2 β’ 20
SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity Paper β’ 2506.16500 β’ Published Jun 19 β’ 17
SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity Paper β’ 2506.16500 β’ Published Jun 19 β’ 17
Radial Attention: $O(n\log n)$ Sparse Attention with Energy Decay for Long Video Generation Paper β’ 2506.19852 β’ Published Jun 24 β’ 41
SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer Paper β’ 2303.17605 β’ Published Mar 30, 2023
HAT: Hardware-Aware Transformers for Efficient Natural Language Processing Paper β’ 2005.14187 β’ Published May 28, 2020 β’ 2
MapPrior: Bird's-Eye View Map Layout Estimation with Generative Models Paper β’ 2308.12963 β’ Published Aug 24, 2023