R136a1 commited on
Commit
a6a8fdb
1 Parent(s): 8ab7e3a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -13,13 +13,13 @@ language:
13
  ## Model details
14
 
15
  First attempt to quantize a 20B model so it can run on 16GB VRAM with the highest quality possible.
16
- Quantized at 3.23bpw with hb 6
17
 
18
  Perplexity:
19
 
20
  Base = 6.4744
21
 
22
- 3.23 h6 = 6.5369
23
 
24
  Dataset = [wikitext](https://huggingface.co/datasets/wikitext/resolve/refs%2Fconvert%2Fparquet/wikitext-2-v1/test/0000.parquet)
25
 
 
13
  ## Model details
14
 
15
  First attempt to quantize a 20B model so it can run on 16GB VRAM with the highest quality possible.
16
+ Quantized at 3.18bpw with hb 6
17
 
18
  Perplexity:
19
 
20
  Base = 6.4744
21
 
22
+ 3.18 h6 = 6.5705
23
 
24
  Dataset = [wikitext](https://huggingface.co/datasets/wikitext/resolve/refs%2Fconvert%2Fparquet/wikitext-2-v1/test/0000.parquet)
25