Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
kernels-community
/
quantization
like
0
Follow
kernels-community
29
kernel
License:
apache-2.0
Model card
Files
Files and versions
Community
1
0da5bf5
quantization
/
fp8
2 contributors
History:
2 commits
danieldk
HF staff
Sync with vLLM
0da5bf5
2 months ago
amd
Add `scaled_(int|fp8)_quant` and `fp8_marlin_gemm`
4 months ago
nvidia
Add `scaled_(int|fp8)_quant` and `fp8_marlin_gemm`
4 months ago
common.cu
Safe
5.71 kB
Add `scaled_(int|fp8)_quant` and `fp8_marlin_gemm`
4 months ago
common.cuh
Safe
5.63 kB
Sync with vLLM
2 months ago
fp8_marlin.cu
Safe
51.1 kB
Add `scaled_(int|fp8)_quant` and `fp8_marlin_gemm`
4 months ago