hugging-quants/Meta-Llama-3.1-405B-Instruct-AWQ-INT4 Text Generation • Updated Sep 13, 2024 • 277 • 37
Llama 3.1 GPTQ, AWQ, and BNB Quants Collection Optimised Quants for high-throughput deployments! Compatible with Transformers, TGI & VLLM 🤗 • 9 items • Updated Sep 26, 2024 • 56