chatglm-6b-int8 / quantization.py

Commit History

Add support for parallel quantization on Mac
a697125

duzx16 commited on

Remove assert in load_cpu_kernel
3218e92

duzx16 commited on

Sync with chatglm-6b
216185d

duzx16 commited on

Init commit
fb85b4d

duzx16 commited on