robertgshaw2 commited on
Commit
c3ddd3d
·
verified ·
1 Parent(s): f9ac629

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -17,4 +17,4 @@ Download our compression-aware inference engines and open source tools for fast
17
 
18
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/60466e4b4f40b01b66151416/2IDqpxbtCtw_ilOZbTSj0.png)
19
 
20
- In this profile we provide accurate model checkpoints compressed with SOTA methods ready to run in vLLM such as W4A16, W8A16, W8A8 (int8 and fp8), and many more!
 
17
 
18
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/60466e4b4f40b01b66151416/2IDqpxbtCtw_ilOZbTSj0.png)
19
 
20
+ In this profile we provide accurate model checkpoints compressed with SOTA methods ready to run in vLLM such as W4A16, W8A16, W8A8 (int8 and fp8), and many more! If you would like help quantizing a model or have a request for us to add a checkpoint, please open an issue in https://github.com/vllm-project/llm-compressor.