Gemma 3 Collection All versions of Google's new multimodal models in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats. • 29 items • Updated about 17 hours ago • 43
view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality 22 days ago • 69
C4AI Aya Vision Collection Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. • 5 items • Updated 22 days ago • 68
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated about 13 hours ago • 767k • 1.23k