llama-7b-4bit-gr128 / README.md
wcde's picture
Create README.md
b824fcf
Generated with: --wbits 4 --groupsize 128 --true-sequential --new-eval --faster-kernel