Generate depth maps from images
Find objects in images using text prompts
Generate anime character voice from text