microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated about 14 hours ago • 441k • 1.12k
Ultravox v0.5 Collection Ultravox is a multimodal Speech LLM built around different pretrained LLMs (frozen) and the whisper-large-v3-turbo (fine-tuned) backbone. • 3 items • Updated about 1 month ago • 8