Upgraded to v1.0!
Convert audio voices using models
Generate captions for images in various styles
Transcribe Japanese audio to text