Add perplexity data.
Browse files
README.md
CHANGED
@@ -11,3 +11,26 @@ For use with [llama.cpp](https://github.com/ggerganov/llama.cpp).
|
|
11 |
- Model: [openlm-research/open_llama_7b](https://huggingface.co/openlm-research/open_llama_7b)
|
12 |
- llama.cpp 4,5,8-bit quantization: build 567(2d5db48) or later
|
13 |
- llama.cpp newer quantization formats: build 616(99009e7) or later
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
11 |
- Model: [openlm-research/open_llama_7b](https://huggingface.co/openlm-research/open_llama_7b)
|
12 |
- llama.cpp 4,5,8-bit quantization: build 567(2d5db48) or later
|
13 |
- llama.cpp newer quantization formats: build 616(99009e7) or later
|
14 |
+
|
15 |
+
## Perplexity
|
16 |
+
|
17 |
+
Calculated with llama.cpp, default settings (context 512, batch 512).
|
18 |
+
Test data: [`wiki.test.raw` of WikiText-103](https://blog.salesforceairesearch.com/the-wikitext-long-term-dependency-language-modeling-dataset/):
|
19 |
+
|
20 |
+
| model | score |
|
21 |
+
| ------------------------- | -----: |
|
22 |
+
| open-llama-7b-q2_K.bin | 8.5152 |
|
23 |
+
| open-llama-7b-q3_K_S.bin | 7.6623 |
|
24 |
+
| open-llama-7b-q3_K.bin | 7.3837 |
|
25 |
+
| open-llama-7b-q3_K_L.bin | 7.3043 |
|
26 |
+
| open-llama-7b-q4_0.bin | 7.2116 |
|
27 |
+
| open-llama-7b-q4_1.bin | 7.1609 |
|
28 |
+
| open-llama-7b-q4_K_S.bin | 7.1516 |
|
29 |
+
| open-llama-7b-q4_K.bin | 7.1116 |
|
30 |
+
| open-llama-7b-q5_0.bin | 7.0353 |
|
31 |
+
| open-llama-7b-q5_K_S.bin | 7.0325 |
|
32 |
+
| open-llama-7b-q5_1.bin | 7.0318 |
|
33 |
+
| open-llama-7b-q5_K.bin | 7.0272 |
|
34 |
+
| open-llama-7b-q6_K.bin | 7.0050 |
|
35 |
+
| open-llama-7b-q8_0.bin | 6.9968 |
|
36 |
+
| open-llama-7b-f16.bin | 6.9966 |
|