Gemma-2-2B-it-4Bit-GPTQ

Quantization

This model was quantized with the Auto-GPTQ library and dataset containing english and russian wikipedia articles. It has lower perplexity on russian data then other GPTQ models.

Safetensors

Model size

861M params

Tensor type

I32

FP16

Inference Providers NEW

This model is not currently available via any of the supported Inference Providers.

The model cannot be deployed to the HF Inference API: The model authors have turned it off explicitly.

Base model

Finetuned

Quantized

(158)

this model