Transcribe audio from microphone, file, or YouTube link
Generate images from text descriptions
Analyze images to detect human poses