Vocal and background audio separator
Spanish finetune for the original F5 model.
Create a 3D model from an image in 10 seconds!
Extract Japanese text from images
OmniParser, turn your LLM into GUI agent
Swap faces in images