VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation Paper ⢠2309.00398 ⢠Published Sep 1, 2023 ⢠22 ⢠6
VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation Paper ⢠2309.00398 ⢠Published Sep 1, 2023 ⢠22 ⢠6
Dual-Stream Diffusion Net for Text-to-Video Generation Paper ⢠2308.08316 ⢠Published Aug 16, 2023 ⢠24 ⢠3
Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech Prompts Paper ⢠2307.07218 ⢠Published Jul 14, 2023 ⢠27 ⢠10
DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing Paper ⢠2306.14435 ⢠Published Jun 26, 2023 ⢠20 ⢠5
AssistGPT: A General Multi-modal Assistant that can Plan, Execute, Inspect, and Learn Paper ⢠2306.08640 ⢠Published Jun 14, 2023 ⢠26 ⢠2
Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation Paper ⢠2306.07954 ⢠Published Jun 13, 2023 ⢠111 ⢠11
Cinematic Mindscapes: High-quality Video Reconstruction from Brain Activity Paper ⢠2305.11675 ⢠Published May 19, 2023 ⢠1 ⢠1
Pengi: An Audio Language Model for Audio Tasks Paper ⢠2305.11834 ⢠Published May 19, 2023 ⢠2 ⢠1
OpenShape: Scaling Up 3D Shape Representation Towards Open-World Understanding Paper ⢠2305.10764 ⢠Published May 18, 2023 ⢠6 ⢠4