65
Gemma 3 12b It
đĽ
Evaluate and generate text based on images and videos
(Unofficial) Gradio demo for Spark-TTS
Transcribe audio to text from URLs or uploads
Transcribe audio files into text
A text-to-speech model powered by SparkAudio and Mobvoi.
Audio Flamingo 2 Demo
Identify and segment objects in images using text, visual, or prompt-free prompts