Text Generation
Transformers
PyTorch
English
Chinese
llama
code
text-generation-inference

Dataset approach to GPTQ quantization

#1
by KrisPi - opened

Hi,

Would you be so kind as to share the approach to such lossless quantization? Have you used the same high-quality instructions that were used in fine-tuning?

Best regards!

CodeFuse AI org

Hi,
Yes, we used a small subset of samples sampled from the fine-tuning dataset for quantization.

codefuse-admin changed discussion status to closed
Your need to confirm your account before you can post a new comment.

Sign up or log in to comment