Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

kernels-community
/
quantization

kernel
Model card Files Files and versions Community
1
quantization / cutlass_w8a8 /c3x
Ctrl+K
Ctrl+K
  • 2 contributors
History: 2 commits
danieldk's picture
danieldk HF Staff
Sync to vLLM 20250627
8aa00a3 11 days ago
  • cutlass_gemm_caller.cuh
    3.98 kB
    Sync on vLLM 20240402 3 months ago
  • scaled_mm.cuh
    5.39 kB
    Sync on vLLM 20240402 3 months ago
  • scaled_mm_azp_sm90_int8.cu
    1.04 kB
    Sync on vLLM 20240402 3 months ago
  • scaled_mm_blockwise_sm100_fp8.cu
    862 Bytes
    Sync to vLLM 20250627 11 days ago
  • scaled_mm_blockwise_sm100_fp8_dispatch.cuh
    11.5 kB
    Sync to vLLM 20250627 11 days ago
  • scaled_mm_blockwise_sm90_fp8.cu
    854 Bytes
    Sync to vLLM 20250627 11 days ago
  • scaled_mm_blockwise_sm90_fp8_dispatch.cuh
    7.71 kB
    Sync on vLLM 20240402 3 months ago
  • scaled_mm_helper.hpp
    3.54 kB
    Sync to vLLM 20250627 11 days ago
  • scaled_mm_kernels.hpp
    2.28 kB
    Sync to vLLM 20250627 11 days ago
  • scaled_mm_sm100_fp8.cu
    980 Bytes
    Sync on vLLM 20240402 3 months ago
  • scaled_mm_sm100_fp8_dispatch.cuh
    5.6 kB
    Sync to vLLM 20250627 11 days ago
  • scaled_mm_sm90_fp8.cu
    972 Bytes
    Sync on vLLM 20240402 3 months ago
  • scaled_mm_sm90_fp8_dispatch.cuh
    4.69 kB
    Sync on vLLM 20240402 3 months ago
  • scaled_mm_sm90_int8.cu
    980 Bytes
    Sync on vLLM 20240402 3 months ago
  • scaled_mm_sm90_int8_dispatch.cuh
    6.52 kB
    Sync on vLLM 20240402 3 months ago