Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
kernels-community
/
quantization
like
0
Follow
kernels-community
29
kernel
License:
apache-2.0
Model card
Files
Files and versions
Community
1
5c6fb68
quantization
/
fp8
2 contributors
History:
1 commit
danieldk
HF staff
Add `scaled_(int|fp8)_quant` and `fp8_marlin_gemm`
5c6fb68
4 months ago
amd
Add `scaled_(int|fp8)_quant` and `fp8_marlin_gemm`
4 months ago
nvidia
Add `scaled_(int|fp8)_quant` and `fp8_marlin_gemm`
4 months ago
common.cu
Safe
5.71 kB
Add `scaled_(int|fp8)_quant` and `fp8_marlin_gemm`
4 months ago
common.cuh
5.69 kB
Add `scaled_(int|fp8)_quant` and `fp8_marlin_gemm`
4 months ago
fp8_marlin.cu
Safe
51.1 kB
Add `scaled_(int|fp8)_quant` and `fp8_marlin_gemm`
4 months ago