Multimodal GGUFs Vision and audio models compatible with llama-server and llama-mtmd-cli Gemma 3 Collection 4 items • Updated 15 days ago • 15 ggml-org/Mistral-Small-3.1-24B-Instruct-2503-GGUF Image-Text-to-Text • Updated 28 days ago • 1.05k • 4 InternVL 3 and InternVL 2.5 Collection 10 items • Updated 15 days ago Qwen 2 VL and Qwen 2.5 VL Collection 4 items • Updated 15 days ago
ggml-org/Mistral-Small-3.1-24B-Instruct-2503-GGUF Image-Text-to-Text • Updated 28 days ago • 1.05k • 4
VAD Voice Activity Detection (VAD) models for whisper.cpp. ggml-org/whisper-vad Updated 16 days ago • 1