llama.cpp and ik_llama.cpp imatrix Quantizations of unsloth/phi-4-GGUF

Imatrix and quantizations created from https://huggingface.co/unsloth/phi-4-GGUF/phi-4-F16.gguf

Imatrix dataset from bartowski1182

llama.cpp

phi-4-IQ2_S.gguf
phi-4-IQ3_XS.gguf
phi-4-Q4_K_M.gguf
phi-4-IQ4_XS.gguf
phi-4-IQ4_NL.gguf

ik_llama.cpp

phi-4-IQ4_KS.gguf
phi-4-IQ4_NL_R4.gguf

Credits

llama.cpp, ik_llama.cpp, bartowski, microsoft, unsloth, huggingface

Downloads last month: 139

GGUF

Model size

14.7B params

Architecture

llama

2-bit

4-bit

View +1 file

Inference Providers NEW

This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API was unable to determine this model's library.

Model tree for mcm07/phi-4-GGUF-imatrix

Base model

microsoft/phi-4

Quantized

(98)

this model