Audio Conditioned LipSync with Latent Diffusion Models
Co-Speech Gesture Video Generation
Remove/Change background of video.
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
AI filter for your portraits