MohamedRashad/Voxtral-Small-24B-2507-transformers Audio-Text-to-Text • 24B • Updated about 1 month ago • 3.73k • 2
Fanar Collection A powerful and versatile family of Arabic Large Language Models (LLMs) designed for a wide range of tasks. • 3 items • Updated Jun 10 • 6
Running on T4 105 105 CountGD_Multi-Modal_Open-World_Counting 🚀 Count objects in images using text, visual examples, or both
view article Article seemore: Implement a Vision Language Model from Scratch By AviSoori1x • Jun 23, 2024 • 94
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 11 items • Updated 27 days ago • 528