FPHam
/

Karen_theEditor_13b_HF

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

FPHam commited on Jun 3, 2023

Commit

6d872e4

•

1 Parent(s): 5055c01

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -10,7 +10,7 @@ Based on LLAMA 13b and Wizard-Vucna-uncensored finetune, then finetuned with abo
 ## Quantized version (Quantized by TheBloke)
-* [4-bit GPTQ models for GPU inference](https://huggingface.co/TheBloke/Karen_theEditor_13B-GPTQ)
 * [4-bit, 5-bit and 8-bit GGML models for CPU(+GPU) inference](https://huggingface.co/TheBloke/Karen_theEditor_13B-GGML)
 Karen gets triggered by this prompt (pun intended):

 ## Quantized version (Quantized by TheBloke)
+* [4-bit GPTQ models for GPU inference](https://huggingface.co/FPHam/Karen_theEditor-13B-4bit-128g-GPTQ)
 * [4-bit, 5-bit and 8-bit GGML models for CPU(+GPU) inference](https://huggingface.co/TheBloke/Karen_theEditor_13B-GGML)
 Karen gets triggered by this prompt (pun intended):