Safetensors
llama
falcon3
4-bit precision
gptq

Commit History

(fix): align quantized models tokenizers with un-quantized ones
b753f4c
verified

ybelkada commited on