ggml.ai

company

https://ggml.ai

ggml_org

ggml-org

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

ngxson new activity 16 days ago

ggml-org/SmolLM3-3B-GGUF:Error loading chat template

ngxson updated a model 16 days ago

ggml-org/SmolLM3-3B-GGUF

ngxson published a model 16 days ago

ggml-org/SmolLM3-3B-GGUF

View all activity

Articles

Introduction to ggml

Aug 13, 2024

• 226

ggml-org 's collections 12

Gemma 3n

ggml-org/gemma-3n-E2B-it-GGUF

4B • Updated 28 days ago • 3.85k • 10
ggml-org/gemma-3n-E4B-it-GGUF

7B • Updated 28 days ago • 5k • 14

VAD

Voice Activity Detection (VAD) models for whisper.cpp.

ggml-org/whisper-vad

Updated May 13 • 5

Qwen 2 VL and Qwen 2.5 VL

ggml-org/Qwen2.5-VL-3B-Instruct-GGUF

3B • Updated Apr 30 • 5.05k • 4
ggml-org/Qwen2.5-VL-7B-Instruct-GGUF

8B • Updated Apr 30 • 2.27k • 6
ggml-org/Qwen2.5-VL-32B-Instruct-GGUF

33B • Updated May 15 • 581 • 2
ggml-org/Qwen2-VL-2B-Instruct-GGUF

2B • Updated Apr 30 • 1.11k • 2

SmolVLM GGUF

ggml-org/SmolVLM2-2.2B-Instruct-GGUF

2B • Updated Apr 30 • 4.15k • 17
ggml-org/SmolVLM2-500M-Video-Instruct-GGUF

0.4B • Updated Apr 30 • 2.17k • 11
ggml-org/SmolVLM2-256M-Video-Instruct-GGUF

0.2B • Updated Apr 30 • 704 • 6
ggml-org/SmolVLM-Instruct-GGUF

2B • Updated Apr 30 • 505 • 6

llama.cpp presets

Models that are used for presets in llama.cpp.

ggml-org/gte-small-Q8_0-GGUF

Sentence Similarity • 0.0B • Updated Feb 6 • 74 • 1
ggml-org/bge-small-en-v1.5-Q8_0-GGUF

Feature Extraction • 0.0B • Updated Feb 6 • 88 • 1
ggml-org/e5-small-v2-Q8_0-GGUF

Sentence Similarity • 0.0B • Updated Feb 6 • 54

llama.vim

ggml-org/Qwen2.5-Coder-0.5B-Q8_0-GGUF

Text Generation • 0.5B • Updated Jan 31 • 3.88k • 6
ggml-org/Qwen2.5-Coder-1.5B-Q8_0-GGUF

Text Generation • 2B • Updated Oct 28, 2024 • 4.92k • 9
ggml-org/Qwen2.5-Coder-3B-Q8_0-GGUF

Text Generation • 3B • Updated Nov 26, 2024 • 1.37k • 5
ggml-org/Qwen2.5-Coder-7B-Q8_0-GGUF

Text Generation • 8B • Updated Oct 28, 2024 • 1.87k • 5

Multimodal GGUFs

Vision and audio models compatible with llama-server and llama-mtmd-cli

Gemma 3

Collection

4 items • Updated May 14 • 17
ggml-org/Mistral-Small-3.1-24B-Instruct-2503-GGUF

Image-Text-to-Text • 24B • Updated May 1 • 234 • 4
InternVL 3 and InternVL 2.5

Collection

10 items • Updated May 14
Qwen 2 VL and Qwen 2.5 VL

Collection

4 items • Updated May 14

InternVL 3 and InternVL 2.5

ggml-org/InternVL3-1B-Instruct-GGUF

0.6B • Updated May 10 • 331 • 3
ggml-org/InternVL3-2B-Instruct-GGUF

2B • Updated May 10 • 497 • 5
ggml-org/InternVL3-8B-Instruct-GGUF

8B • Updated May 10 • 720 • 5
ggml-org/InternVL3-14B-Instruct-GGUF

15B • Updated May 10 • 484 • 3

Qwen 3

ggml-org/Qwen3-0.6B-GGUF

0.8B • Updated Apr 28 • 850 • 5
ggml-org/Qwen3-1.7B-GGUF

2B • Updated Apr 28 • 1.09k • 1
ggml-org/Qwen3-4B-GGUF

4B • Updated Apr 28 • 421 • 1
ggml-org/Qwen3-8B-GGUF

8B • Updated Apr 28 • 523 • 3

Gemma 3

ggml-org/gemma-3-1b-it-GGUF

1.0B • Updated Mar 12 • 11.1k • 15
ggml-org/gemma-3-4b-it-GGUF

Image-Text-to-Text • 4B • Updated May 21 • 21.4k • 33
ggml-org/gemma-3-12b-it-GGUF

Image-Text-to-Text • 12B • Updated May 21 • 4.35k • 25
ggml-org/gemma-3-27b-it-GGUF

Image-Text-to-Text • 27B • Updated May 21 • 2.36k • 21

GGUF LoRA adapters

Adapters extracted from fine tuned models, using mergekit-extract-lora

ggml-org/LoRA-Llama-3-Instruct-abliteration-8B-F16-GGUF

0.1B • Updated Nov 1, 2024 • 36
ggml-org/LoRA-Qwen2.5-1.5B-Instruct-abliterated-F16-GGUF

0.1B • Updated Jan 23 • 15 • 2
ggml-org/LoRA-Qwen2.5-3B-Instruct-abliterated-F16-GGUF

0.1B • Updated Jan 9 • 21 • 1
ggml-org/LoRA-Qwen2.5-7B-Instruct-abliterated-v3-F16-GGUF

0.1B • Updated Jan 8 • 34 • 3

Gemma 1.1 GGUFs

ggml-org/gemma-1.1-2b-it-Q8_0-GGUF

3B • Updated Apr 5, 2024 • 1.39k • 1
ggml-org/gemma-1.1-7b-it-Q8_0-GGUF

9B • Updated Apr 5, 2024 • 11
ggml-org/gemma-1.1-7b-it-Q4_K_M-GGUF

9B • Updated Apr 5, 2024 • 398 • 4

Gemma 3n

ggml-org/gemma-3n-E2B-it-GGUF

4B • Updated 28 days ago • 3.85k • 10
ggml-org/gemma-3n-E4B-it-GGUF

7B • Updated 28 days ago • 5k • 14

Multimodal GGUFs

Vision and audio models compatible with llama-server and llama-mtmd-cli

Gemma 3

Collection

4 items • Updated May 14 • 17
ggml-org/Mistral-Small-3.1-24B-Instruct-2503-GGUF

Image-Text-to-Text • 24B • Updated May 1 • 234 • 4
InternVL 3 and InternVL 2.5

Collection

10 items • Updated May 14
Qwen 2 VL and Qwen 2.5 VL

Collection

4 items • Updated May 14

VAD

Voice Activity Detection (VAD) models for whisper.cpp.

ggml-org/whisper-vad

Updated May 13 • 5

InternVL 3 and InternVL 2.5

ggml-org/InternVL3-1B-Instruct-GGUF

0.6B • Updated May 10 • 331 • 3
ggml-org/InternVL3-2B-Instruct-GGUF

2B • Updated May 10 • 497 • 5
ggml-org/InternVL3-8B-Instruct-GGUF

8B • Updated May 10 • 720 • 5
ggml-org/InternVL3-14B-Instruct-GGUF

15B • Updated May 10 • 484 • 3

Qwen 2 VL and Qwen 2.5 VL

ggml-org/Qwen2.5-VL-3B-Instruct-GGUF

3B • Updated Apr 30 • 5.05k • 4
ggml-org/Qwen2.5-VL-7B-Instruct-GGUF

8B • Updated Apr 30 • 2.27k • 6
ggml-org/Qwen2.5-VL-32B-Instruct-GGUF

33B • Updated May 15 • 581 • 2
ggml-org/Qwen2-VL-2B-Instruct-GGUF

2B • Updated Apr 30 • 1.11k • 2

Qwen 3

ggml-org/Qwen3-0.6B-GGUF

0.8B • Updated Apr 28 • 850 • 5
ggml-org/Qwen3-1.7B-GGUF

2B • Updated Apr 28 • 1.09k • 1
ggml-org/Qwen3-4B-GGUF

4B • Updated Apr 28 • 421 • 1
ggml-org/Qwen3-8B-GGUF

8B • Updated Apr 28 • 523 • 3

SmolVLM GGUF

ggml-org/SmolVLM2-2.2B-Instruct-GGUF

2B • Updated Apr 30 • 4.15k • 17
ggml-org/SmolVLM2-500M-Video-Instruct-GGUF

0.4B • Updated Apr 30 • 2.17k • 11
ggml-org/SmolVLM2-256M-Video-Instruct-GGUF

0.2B • Updated Apr 30 • 704 • 6
ggml-org/SmolVLM-Instruct-GGUF

2B • Updated Apr 30 • 505 • 6

Gemma 3

ggml-org/gemma-3-1b-it-GGUF

1.0B • Updated Mar 12 • 11.1k • 15
ggml-org/gemma-3-4b-it-GGUF

Image-Text-to-Text • 4B • Updated May 21 • 21.4k • 33
ggml-org/gemma-3-12b-it-GGUF

Image-Text-to-Text • 12B • Updated May 21 • 4.35k • 25
ggml-org/gemma-3-27b-it-GGUF

Image-Text-to-Text • 27B • Updated May 21 • 2.36k • 21

llama.cpp presets

Models that are used for presets in llama.cpp.

ggml-org/gte-small-Q8_0-GGUF

Sentence Similarity • 0.0B • Updated Feb 6 • 74 • 1
ggml-org/bge-small-en-v1.5-Q8_0-GGUF

Feature Extraction • 0.0B • Updated Feb 6 • 88 • 1
ggml-org/e5-small-v2-Q8_0-GGUF

Sentence Similarity • 0.0B • Updated Feb 6 • 54

GGUF LoRA adapters

Adapters extracted from fine tuned models, using mergekit-extract-lora

ggml-org/LoRA-Llama-3-Instruct-abliteration-8B-F16-GGUF

0.1B • Updated Nov 1, 2024 • 36
ggml-org/LoRA-Qwen2.5-1.5B-Instruct-abliterated-F16-GGUF

0.1B • Updated Jan 23 • 15 • 2
ggml-org/LoRA-Qwen2.5-3B-Instruct-abliterated-F16-GGUF

0.1B • Updated Jan 9 • 21 • 1
ggml-org/LoRA-Qwen2.5-7B-Instruct-abliterated-v3-F16-GGUF

0.1B • Updated Jan 8 • 34 • 3

llama.vim

ggml-org/Qwen2.5-Coder-0.5B-Q8_0-GGUF

Text Generation • 0.5B • Updated Jan 31 • 3.88k • 6
ggml-org/Qwen2.5-Coder-1.5B-Q8_0-GGUF

Text Generation • 2B • Updated Oct 28, 2024 • 4.92k • 9
ggml-org/Qwen2.5-Coder-3B-Q8_0-GGUF

Text Generation • 3B • Updated Nov 26, 2024 • 1.37k • 5
ggml-org/Qwen2.5-Coder-7B-Q8_0-GGUF

Text Generation • 8B • Updated Oct 28, 2024 • 1.87k • 5

Gemma 1.1 GGUFs

ggml-org/gemma-1.1-2b-it-Q8_0-GGUF

3B • Updated Apr 5, 2024 • 1.39k • 1
ggml-org/gemma-1.1-7b-it-Q8_0-GGUF

9B • Updated Apr 5, 2024 • 11
ggml-org/gemma-1.1-7b-it-Q4_K_M-GGUF

9B • Updated Apr 5, 2024 • 398 • 4

AI & ML interests

Recent Activity

Articles

Introduction to ggml

Team members 10

ggml-org 's collections 12