AI & ML interests

Optimised quants for high-throughput deployments! Compatible with Transformers, TGI & vLLM 🤗

hugging-quants 's collections 3

Llama 3.2 3B & 1B GGUF Quants
Llama.cpp compatible quants for Llama 3.2 3B and 1B Instruct models.
Llama 3.1 GPTQ, AWQ, and BNB Quants
Optimised Quants for high-throughput deployments! Compatible with Transformers, TGI & VLLM 🤗