akumaburn commited on
Commit
bd2a640
·
verified ·
1 Parent(s): 9afe5a7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -1
README.md CHANGED
@@ -34,11 +34,18 @@ datasets:
34
 
35
  Some GGUF quantizations are included as well.
36
 
 
 
 
 
 
37
  Open_Orca_Llama-3-8B-unsloth.Q8_0.gguf:
38
  - **MMLU-Test:** Final result: 39.3818 +/- 0.4138
 
39
 
40
  Meta-Llama-3-8B.Q8_0.gguf:
41
- - **MMLU-Test:** Pending..
 
42
 
43
  Llama.cpp Options For Testing:
44
  --samplers "tfs;typical;temp" --draft 32 --ctx-size 8192 --temp 0.82 --tfs 0.8 --typical 1.1 --repeat-last-n 512 --batch-size 8192 --repeat-penalty 1.0 --n-gpu-layers 100 --threads 12
 
34
 
35
  Some GGUF quantizations are included as well.
36
 
37
+
38
+ llama-3-8b-bnb-4bit.Q8_0.gguf:
39
+ - **MMLU-Test:** Pending..
40
+ - **Arc-Easy:** Pending..
41
+
42
  Open_Orca_Llama-3-8B-unsloth.Q8_0.gguf:
43
  - **MMLU-Test:** Final result: 39.3818 +/- 0.4138
44
+ - **Arc-Easy:** Final result: 67.3684 +/- 1.9656
45
 
46
  Meta-Llama-3-8B.Q8_0.gguf:
47
+ - **MMLU-Test:** Final result: 40.8664 +/- 0.4163
48
+ - **Arc-Easy:** Final result: 74.3860 +/- 1.8299
49
 
50
  Llama.cpp Options For Testing:
51
  --samplers "tfs;typical;temp" --draft 32 --ctx-size 8192 --temp 0.82 --tfs 0.8 --typical 1.1 --repeat-last-n 512 --batch-size 8192 --repeat-penalty 1.0 --n-gpu-layers 100 --threads 12