Wonderland: Navigating 3D Scenes from a Single Image Paper • 2412.12091 • Published 9 days ago • 14
SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training Paper • 2412.09619 • Published 13 days ago • 20
BitsFusion: 1.99 bits Weight Quantization of Diffusion Model Paper • 2406.04333 • Published Jun 6 • 36
TextCraftor: Your Text Encoder Can be Image Quality Controller Paper • 2403.18978 • Published Mar 27 • 13
Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers Paper • 2402.19479 • Published Feb 29 • 32
Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis Paper • 2402.14797 • Published Feb 22 • 19
LightSpeed: Light and Fast Neural Light Fields on Mobile Devices Paper • 2310.16832 • Published Oct 25, 2023 • 4
HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion Paper • 2310.08579 • Published Oct 12, 2023 • 15
R2L: Distilling Neural Radiance Field to Neural Light Field for Efficient Novel View Synthesis Paper • 2203.17261 • Published Mar 31, 2022 • 1
Rethinking Vision Transformers for MobileNet Size and Speed Paper • 2212.08059 • Published Dec 15, 2022 • 4
Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors Paper • 2306.17843 • Published Jun 30, 2023 • 43
SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two Seconds Paper • 2306.00980 • Published Jun 1, 2023 • 15