Running on Zero 536 536 MMAudio β generating synchronized audio from video/text π Generate audio from video or text prompts
Running 543 543 Talking Face Generation with Multilingual TTS π Generate a talking face video from text