disinfozone
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -45,7 +45,16 @@ You can try other similar prompts, we've had success with them, but this remains
|
|
45 |
|
46 |
## GGUFs
|
47 |
|
48 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
49 |
|
50 |
## How to Run
|
51 |
|
|
|
45 |
|
46 |
## GGUFs
|
47 |
|
48 |
+
Typically I like Q5_K_M or Q8_0. You get better quality running the highest quant you can, especially with these small models. I haven't bothered with quants smaller than Q4.
|
49 |
+
|
50 |
+
| Name | Quant method | Bits | Size | Max RAM required | Use case |
|
51 |
+
| ---- | ---- | ---- | ---- | ---- | ----- |
|
52 |
+
| [Disinfo4_mistral-ft-optimized-1218.Q4_K_S.gguf](https://huggingface.co/disinfozone/Disinfo4_mistral-ft-optimized-1218_GGUF/blob/main/Disinfo4_mistral-ft-optimized-1218.Q4_K_S.gguf) | Q4_K_S | 4 | 4.14 GB| 6.64 GB | small, greater quality loss |
|
53 |
+
| [Disinfo4_mistral-ft-optimized-1218.Q4_K_M.gguf](https://huggingface.co/disinfozone/Disinfo4_mistral-ft-optimized-1218_GGUF/blob/main/Disinfo4_mistral-ft-optimized-1218.Q4_K_M.gguf) | Q4_K_M | 4 | 4.37 GB| 6.87 GB | medium, balanced quality - recommended |
|
54 |
+
| [Disinfo4_mistral-ft-optimized-1218.Q5_K_S.gguf](https://huggingface.co/disinfozone/Disinfo4_mistral-ft-optimized-1218_GGUF/blob/main/Disinfo4_mistral-ft-optimized-1218.Q5_K_S.gguf) | Q5_K_S | 5 | 5.00 GB| 7.50 GB | large, low quality loss - recommended |
|
55 |
+
| [disinfo4_mistral-ft-optimized-1218.Q5_K_M.gguf](https://huggingface.co/disinfozone/Disinfo4_mistral-ft-optimized-1218_GGUF/blob/main/disinfo4_mistral-ft-optimized-1218.Q5_K_M.gguf) | Q5_K_M | 5 | 5.13 GB| 7.63 GB | large, very low quality loss - recommended |
|
56 |
+
| [Disinfo4_mistral-ft-optimized-1218.Q6_K.gguf](https://huggingface.co/disinfozone/Disinfo4_mistral-ft-optimized-1218_GGUF/blob/main/Disinfo4_mistral-ft-optimized-1218.Q6_K.gguf) | Q6_K | 6 | 5.94 GB| 8.44 GB | very large, extremely low quality loss |
|
57 |
+
| [disinfo4_mistral-ft-optimized-1218.gguf](https://huggingface.co/disinfozone/Disinfo4_mistral-ft-optimized-1218_GGUF/blob/main/disinfo4_mistral-ft-optimized-1218.Q8_0.gguf) | Q8_0 | 8 | 7.70 GB| 10.20 GB | very large, extremely low quality loss - not recommended |
|
58 |
|
59 |
## How to Run
|
60 |
|