99
Phi 4 Multimodal
🌖
Interact with AI using text, images, or audio
Talk to OpenAI (Gradio UI)
Say computer (Gradio)
Talk to Gemini using Google's multimodal API
Talk to Phonic AI's speech-to-speech model
LLM Voice by ElevenLabs (Gradio)
Transcribe audio in realtime - Gradio UI version
Llama 3.2 - SambaNova API