Update README.md
Browse files
README.md
CHANGED
@@ -5,6 +5,7 @@ inference: false
|
|
5 |
|
6 |
**NOTE: This GGML conversion is primarily for use with llama.cpp.**
|
7 |
- 13B parameters
|
|
|
8 |
- Based on version 1.1
|
9 |
- Used PR "More accurate Q4_0 and Q4_1 quantizations #896" (should be closer in quality to unquantized)
|
10 |
|
|
|
5 |
|
6 |
**NOTE: This GGML conversion is primarily for use with llama.cpp.**
|
7 |
- 13B parameters
|
8 |
+
- 4-bit quantized
|
9 |
- Based on version 1.1
|
10 |
- Used PR "More accurate Q4_0 and Q4_1 quantizations #896" (should be closer in quality to unquantized)
|
11 |
|