quantization / ext-torch /torch_binding.cpp

Commit History

Add full Marlin support and tests for Marlin/CUTLASS
165b25c

danieldk HF staff commited on

Add GPTQ-Marlin
c31b5ce

danieldk HF staff commited on

Add `scaled_(int|fp8)_quant` and `fp8_marlin_gemm`
5c6fb68

danieldk HF staff commited on

Add cutlass_w8a8
b4cad21

danieldk HF staff commited on