TheBloke
/

stable-vicuna-13B-GGML

Model card Files Files and versions Community

TheBloke commited on Apr 29, 2023

Commit

4d30683

•

1 Parent(s): 3b1060b

Update README.md

Files changed (1) hide show

README.md +2 -7

README.md CHANGED Viewed

@@ -67,17 +67,12 @@ Don't expect any third-party UIs/tools to support them yet.
 I use the following command line; adjust for your tastes and needs:
-```
-./main -t 18 -m stable-vicuna-13B.ggml.q4_2.bin --color -c 2048 --temp 0.7 --repeat_penalty 1.1 -n -1 -r "### Human:" -i
-```
-Change `-t 18` to the number of physical CPU cores you have. For example if your system has 8 cores/16 threads, use `-t 8`.
-If you want to enter a prompt from the command line, use `-p <PROMPT>` like so:
 ```
 ./main -t 18 -m stable-vicuna-13B.ggml.q4_2.bin --color -c 2048 --temp 0.7 --repeat_penalty 1.1 -n -1 -r "### Human:" -p "### Human: write a story about llamas ### Assistant:"
 ```
 ## How to run in `text-generation-webui`
 GGML models can be loaded into text-generation-webui by installing the llama.cpp module, then placing the ggml model file in a model folder as usual.

 I use the following command line; adjust for your tastes and needs:
 ```
 ./main -t 18 -m stable-vicuna-13B.ggml.q4_2.bin --color -c 2048 --temp 0.7 --repeat_penalty 1.1 -n -1 -r "### Human:" -p "### Human: write a story about llamas ### Assistant:"
 ```
+Change `-t 18` to the number of physical CPU cores you have. For example if your system has 8 cores/16 threads, use `-t 8`.
 ## How to run in `text-generation-webui`
 GGML models can be loaded into text-generation-webui by installing the llama.cpp module, then placing the ggml model file in a model folder as usual.