Transcribe audio from microphone, file, or YouTube link
Generate audio from text using a voice synthesis model