Transcribe audio from microphone, file, or YouTube link
Generate audio from text using a voice synthesis model
Generate audio from text using voice synthesis
Generate speech from text using a reference audio sample