DaraV
/

LLaMA-3.1-8B-Instruct-INT4-GPTQ

4-bit precision

Model card Files Files and versions Community

LLaMA-3.1-8B Instruct model quantized to INT-4 using AutoGPTQ.

Downloads last month: 17

Safetensors

Model size

1.99B params

Tensor type

FP16

·

I32

·

Inference API

Unable to determine this model's library. Check the docs .

Model tree for DaraV/LLaMA-3.1-8B-Instruct-INT4-GPTQ

Base model

meta-llama/Llama-3.1-8B

Finetuned

meta-llama/Llama-3.1-8B-Instruct

Quantized

(313)

this model