pix2gestalt: Amodal Segmentation by Synthesizing Wholes Paper • 2401.14398 • Published Jan 25, 2024 • 10
Diffuse to Choose: Enriching Image Conditioned Inpainting in Latent Diffusion Models for Virtual Try-All Paper • 2401.13795 • Published Jan 24, 2024 • 67
Incremental FastPitch: Chunk-based High Quality Text to Speech Paper • 2401.01755 • Published Jan 3, 2024 • 9
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models Paper • 2401.01335 • Published Jan 2, 2024 • 64
PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion Paper • 2312.16486 • Published Dec 27, 2023 • 7
A Recipe for Scaling up Text-to-Video Generation with Text-free Videos Paper • 2312.15770 • Published Dec 25, 2023 • 13
HarmonyView: Harmonizing Consistency and Diversity in One-Image-to-3D Paper • 2312.15980 • Published Dec 26, 2023 • 11
One-dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications Paper • 2312.16145 • Published Dec 26, 2023 • 9
RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail Richness in Text-to-3D Paper • 2311.16918 • Published Nov 28, 2023 • 9
PICTURE: PhotorealistIC virtual Try-on from UnconstRained dEsigns Paper • 2312.04534 • Published Dec 7, 2023 • 6
General Object Foundation Model for Images and Videos at Scale Paper • 2312.09158 • Published Dec 14, 2023 • 9