Generate images using prompts and LoRA models
Create personalized speech using text and audio samples
Generate a 3D mesh model from an image
In-browser speech recognition w/ word-level timestamps
Apply the motion of a video on a portrait