3D Generation from text prompts
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Generate realistic talking heads from image+audio