LLaMA-3.1-8B Instruct model quantized to INT-4 using AutoGPTQ.

Downloads last month
17
Safetensors
Model size
1.99B params
Tensor type
FP16
·
I32
·
Inference API
Unable to determine this model's library. Check the docs .

Model tree for DaraV/LLaMA-3.1-8B-Instruct-INT4-GPTQ

Quantized
(313)
this model