Generate and convert speech using text and audio inputs
Generate audio from text using a voice synthesis model
Remove background from images