The official prequantized EfficientQAT models.
-
ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w4g128
Text Generation • Updated • 15 -
ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w2g64
Text Generation • Updated • 13 -
ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w2g128
Text Generation • Updated • 12 -
ChenMnZ/Llama-3-8b-EfficientQAT-w4g128
Text Generation • Updated • 17