Edit Models filters

Apps

Docker Model Runner

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

21,947

Full-text search

Active filters: llama-cpp

fernandoruiz/OpenELM-3B-Instruct-Q4_K_M-GGUF

3B • Updated Jul 22, 2024 • 1

fernandoruiz/OpenELM-3B-Instruct-Q4_K_S-GGUF

3B • Updated Jul 22, 2024 • 2

fernandoruiz/Mistral-Nemo-Instruct-2407-Q4_K_M-GGUF

12B • Updated Jul 22, 2024 • 11

fernandoruiz/Mistral-Nemo-Instruct-2407-Q4_K_S-GGUF

12B • Updated Jul 22, 2024 • 5

victorbur/deepseek-coder-6.7B-kexer-Q8_0-GGUF

7B • Updated Jul 22, 2024 • 7

Jianping746/DeepSeek-V2-Lite-Chat-Q6_K-GGUF

16B • Updated Jul 22, 2024 • 24

Jianping746/DeepSeek-V2-Lite-Chat-Q4_K_M-GGUF

16B • Updated Jul 22, 2024 • 62

NikolayKozloff/Mistral-Nemo-Instruct-2407-Q8_0-GGUF

12B • Updated Jul 22, 2024 • 7 • 1

NikolayKozloff/Mistral-Nemo-Instruct-2407-Q6_K-GGUF

12B • Updated Jul 22, 2024 • 6 • 1

NikolayKozloff/Mistral-Nemo-Instruct-2407-Q5_K_M-GGUF

12B • Updated Jul 22, 2024 • 6 • 1

NikolayKozloff/Mistral-Nemo-Instruct-2407-Q5_K_S-GGUF

12B • Updated Jul 22, 2024 • 4 • 1

NikolayKozloff/SauerkrautLM-Nemo-12b-Instruct-Q8_0-GGUF

12B • Updated Jul 22, 2024 • 7 • 1

victorbur/SpyazWeb_AI_DeepMind_Project-Q8_0-GGUF

7B • Updated Jul 22, 2024 • 9

NikolayKozloff/SauerkrautLM-Nemo-12b-Instruct-Q6_K-GGUF

12B • Updated Jul 22, 2024 • 5 • 1

NikolayKozloff/SauerkrautLM-Nemo-12b-Instruct-Q5_K_M-GGUF

12B • Updated Jul 22, 2024 • 6 • 1

NikolayKozloff/SauerkrautLM-Nemo-12b-Instruct-Q5_K_S-GGUF

12B • Updated Jul 22, 2024 • 4 • 1

bunnycore/L3-Intermix-v0.2-Q4_K_M-GGUF

8B • Updated Jul 22, 2024 • 4 • 1

martintomov/Meta-Llama-3-8B-Alternate-Tokenizer-Q4_K_S-GGUF

Text Generation • 8B • Updated Jul 22, 2024 • 2

inflatebot/helide-alpha-r4-Q8_0-GGUF

8B • Updated Jul 22, 2024 • 1

alanrios2001/deepseek-coder-7b-instruct-v1.5-Q6_K-GGUF

7B • Updated Jul 22, 2024 • 4

spachava/DeepSeek-Coder-V2-Lite-Instruct-Q8_0-GGUF

16B • Updated Jul 22, 2024 • 4

pseelam/Mistral-7B-Instruct-v0.3-Q4_K_M-GGUF

7B • Updated Jul 22, 2024 • 6

shisahni/Meta-Llama-3-8B-Instruct-Q4_K_M-GGUF

Text Generation • 8B • Updated Jul 23, 2024 • 4

mjschock/open_llama_3b_v2-Q8_0-GGUF

3B • Updated Jul 23, 2024 • 4 • 1

nvhf/Triplex-Q6_K-GGUF

4B • Updated Jul 23, 2024 • 1

toktomo/Phi-3-medium-4k-instruct-Q8_0-GGUF

Text Generation • 14B • Updated Jul 23, 2024 • 2

biggesto/news_mistral_test-Q5_K_M-GGUF

7B • Updated Jul 23, 2024 • 2

elvispresniy/Qwen2-1.5B-Instruct-Q4_K_M-GGUF

Text Generation • 2B • Updated Jul 23, 2024 • 4

linxsxs/Meta-Llama-3-8B-Instruct-Q4_K_M-GGUF

Text Generation • 8B • Updated Jul 23, 2024

xsydorm00/Meta-Llama-3-8B-Q4_K_M-GGUF

Text Generation • 8B • Updated Jul 23, 2024 • 2