nisten
/

deepseek-coder-v2-inst-cpu-optimized-gguf

Model card Files Files and versions

nisten commited on Jul 17, 2024

Commit

1b861d6

·

verified ·

1 Parent(s): d238293

Update README.md

Files changed (1) hide show

README.md +4 -2

README.md CHANGED Viewed

@@ -8,8 +8,10 @@ base_model: [deepseek-ai/DeepSeek-Coder-V2-Instruct]
 ### While it required custom code to make, it is standard compatible with plain llama.cpp from github or just search nisten in lmstudio
 >[!TIP]
->The following 4bit version is the one I use myself, it gets 17tps on 64 arm cores
->You don't need to consolidates the files anymore, just point llama-cli to the first one and it'll handle the rest fine.
 >Then to run in commandline interactive mode (prompt.txt file is optional) just do:
 >```c++
 >./llama-cli --temp 0.4 -m deepseek_coder_v2_cpu_iq4xm.gguf-00001-of-00004.gguf -c 32000 -co -cnv -i -f prompt.txt

 ### While it required custom code to make, it is standard compatible with plain llama.cpp from github or just search nisten in lmstudio
 >[!TIP]
+>The following 4bit version is the one I use myself, it gets 17tps on 64 arm cores.
+>
+>You don't need to consolidate the files anymore, just point llama-cli to the first one and it'll handle the rest fine.
+>
 >Then to run in commandline interactive mode (prompt.txt file is optional) just do:
 >```c++
 >./llama-cli --temp 0.4 -m deepseek_coder_v2_cpu_iq4xm.gguf-00001-of-00004.gguf -c 32000 -co -cnv -i -f prompt.txt