Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
kernels-community
/
quantization
like
0
Follow
kernels-community
29
kernel
License:
apache-2.0
Model card
Files
Files and versions
Community
1
88883b3
quantization
/
gptq_marlin
2 contributors
History:
2 commits
danieldk
HF staff
Add GPTQ-Marlin
c31b5ce
4 months ago
awq_marlin_repack.cu
Safe
8.79 kB
Add GPTQ-Marlin
4 months ago
gptq_marlin.cu
95.1 kB
Add GPTQ-Marlin
4 months ago
gptq_marlin_repack.cu
Safe
11.1 kB
Add GPTQ-Marlin
4 months ago
marlin.cuh
Safe
2.27 kB
Add `scaled_(int|fp8)_quant` and `fp8_marlin_gemm`
4 months ago
marlin_dtypes.cuh
Safe
1.99 kB
Add `scaled_(int|fp8)_quant` and `fp8_marlin_gemm`
4 months ago