TheBloke
/

medalpaca-13B-GPTQ

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions Community

TheBloke commited on Apr 22, 2023

Commit

f0529f6

·

1 Parent(s): e3b088b

Update README.md

Files changed (1) hide show

README.md +8 -2

README.md CHANGED Viewed

@@ -27,12 +27,18 @@ Unless you are able to use the latest GPTQ-for-LLaMa code, please use `medalpaca
 * `medalpaca-13B-GPTQ-4bit-128g.no-act-order.safetensors`
   * Created with the latest GPTQ-for-LLaMa code
   * Parameters: Groupsize = 128g. No act-order.
-  * Command: `CUDA_VISIBLE_DEVICES=0 python3 llama.py medalpaca-13b c4 --wbits 4 --true-sequential --act-order --groupsize 128 --save_safetensors medalpaca-13B-GPTQ-4bit-128g.no-act-order.safetensors`
 * `medalpaca-13B-GPTQ-4bit-128g.safetensors`
   * Created with the latest GPTQ-for-LLaMa code
   * Parameters: Groupsize = 128g. act-order.
   * Offers highest quality quantisation, but requires recent GPTQ-for-LLaMa code
-  * Command: `CUDA_VISIBLE_DEVICES=0 python3 llama.py medalpaca-13b c4 --wbits 4 --true-sequential --act-order --groupsize 128 --save_safetensors medalpaca-13B-GPTQ-4bit-128g.safetensors`
 ## How to run in `text-generation-webui`

 * `medalpaca-13B-GPTQ-4bit-128g.no-act-order.safetensors`
   * Created with the latest GPTQ-for-LLaMa code
   * Parameters: Groupsize = 128g. No act-order.
+  * Command:
+    ```
+    CUDA_VISIBLE_DEVICES=0 python3 llama.py medalpaca-13b c4 --wbits 4 --true-sequential --groupsize 128 --save_safetensors medalpaca-13B-GPTQ-4bit-128g.no-act-order.safetensors
+    ```
 * `medalpaca-13B-GPTQ-4bit-128g.safetensors`
   * Created with the latest GPTQ-for-LLaMa code
   * Parameters: Groupsize = 128g. act-order.
   * Offers highest quality quantisation, but requires recent GPTQ-for-LLaMa code
+  * Command:
+    ```
+    CUDA_VISIBLE_DEVICES=0 python3 llama.py medalpaca-13b c4 --wbits 4 --true-sequential --act-order --groupsize 128 --save_safetensors medalpaca-13B-GPTQ-4bit-128g.safetensors
+    ```
 ## How to run in `text-generation-webui`