Quantized QwQ 32B
Collection
6 items
•
Updated
This is Qwen/QwQ-32B quantized with AutoRound (symmetric quantization) and serialized with the GPTQ format in 4-bit. The model has been created, tested, and evaluated by The Kaitchup. The model is compatible with vLLM and Transformers.
Details on the quantization process and how to use the model here: The Kaitchup
Subscribe to The Kaitchup. This helps me a lot to continue quantizing and evaluating models for free.