TehVenom
/

Metharme-13b-8bit-GPTQ

Text Generation

text generation

text-generation-inference

Model card Files Files and versions Community

TehVenom commited on May 19, 2023

Commit

11a73e1

•

1 Parent(s): aeccaa5

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -24,7 +24,7 @@ It was trained by doing supervised fine-tuning over a mixture of regular instruc
 The current Metharme-13b has been trained as a LoRA, then merged down to the base model for distribuition.
-It has also been quantized down to 4Bit using the GPTQ library available here: https://github.com/0cc4m/GPTQ-for-LLaMa
 ```
 python llama.py .\TehVenom_Metharme-13b-Merged c4 --wbits 8 --act-order --save_safetensors Metharme-13b-GPTQ-8bit.act-order.safetensors
 ```

 The current Metharme-13b has been trained as a LoRA, then merged down to the base model for distribuition.
+It has also been quantized down to 8Bit using the GPTQ library available here: https://github.com/0cc4m/GPTQ-for-LLaMa
 ```
 python llama.py .\TehVenom_Metharme-13b-Merged c4 --wbits 8 --act-order --save_safetensors Metharme-13b-GPTQ-8bit.act-order.safetensors
 ```