Wonderland: Navigating 3D Scenes from a Single Image Paper • 2412.12091 • Published 9 days ago • 14
SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training Paper • 2412.09619 • Published 13 days ago • 20
SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training Paper • 2412.09619 • Published 13 days ago • 20 • 3
BitsFusion: 1.99 bits Weight Quantization of Diffusion Model Paper • 2406.04333 • Published Jun 6 • 36
SINE: SINgle Image Editing with Text-to-Image Diffusion Models Paper • 2212.04489 • Published Dec 8, 2022
BitsFusion: 1.99 bits Weight Quantization of Diffusion Model Paper • 2406.04333 • Published Jun 6 • 36
TextCraftor: Your Text Encoder Can be Image Quality Controller Paper • 2403.18978 • Published Mar 27 • 13
TextCraftor: Your Text Encoder Can be Image Quality Controller Paper • 2403.18978 • Published Mar 27 • 13
EfficientFormer: Vision Transformers at MobileNet Speed Paper • 2206.01191 • Published Jun 2, 2022 • 1
COMCAT: Towards Efficient Compression and Customization of Attention-Based Vision Models Paper • 2305.17235 • Published May 26, 2023 • 2
Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation Paper • 2206.07771 • Published Jun 15, 2022
iNVS: Repurposing Diffusion Inpainters for Novel View Synthesis Paper • 2310.16167 • Published Oct 24, 2023 • 1
Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers Paper • 2402.19479 • Published Feb 29 • 32
Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers Paper • 2402.19479 • Published Feb 29 • 32