Llama2 quantized 4bit model with bitsandbytes.

Downloads last month
10
Safetensors
Model size
3.6B params
Tensor type
F32
FP16
U8
Inference Providers NEW
Inference Providers available for this model are disabled. Settings

Model tree for corneille97/llama-2-7b-4bits-turbo

Quantizations
1 model