bartowski commited on
Commit
f726cd1
1 Parent(s): e87da8b

measurement.json

Browse files
Files changed (2) hide show
  1. README.md +129 -0
  2. measurement.json +0 -0
README.md ADDED
@@ -0,0 +1,129 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ pipeline_tag: text-generation
3
+ inference: true
4
+ widget:
5
+ - text: 'def print_hello_world():'
6
+ example_title: Hello world
7
+ group: Python
8
+ datasets:
9
+ - bigcode/the-stack-v2-train
10
+ license: bigcode-openrail-m
11
+ library_name: transformers
12
+ tags:
13
+ - code
14
+ model-index:
15
+ - name: starcoder2-3b
16
+ results:
17
+ - task:
18
+ type: text-generation
19
+ dataset:
20
+ name: CruxEval-I
21
+ type: cruxeval-i
22
+ metrics:
23
+ - type: pass@1
24
+ value: 32.7
25
+ - task:
26
+ type: text-generation
27
+ dataset:
28
+ name: DS-1000
29
+ type: ds-1000
30
+ metrics:
31
+ - type: pass@1
32
+ value: 25.0
33
+ - task:
34
+ type: text-generation
35
+ dataset:
36
+ name: GSM8K (PAL)
37
+ type: gsm8k-pal
38
+ metrics:
39
+ - type: accuracy
40
+ value: 27.7
41
+ - task:
42
+ type: text-generation
43
+ dataset:
44
+ name: HumanEval+
45
+ type: humanevalplus
46
+ metrics:
47
+ - type: pass@1
48
+ value: 27.4
49
+ - task:
50
+ type: text-generation
51
+ dataset:
52
+ name: HumanEval
53
+ type: humaneval
54
+ metrics:
55
+ - type: pass@1
56
+ value: 31.7
57
+ - task:
58
+ type: text-generation
59
+ dataset:
60
+ name: RepoBench-v1.1
61
+ type: repobench-v1.1
62
+ metrics:
63
+ - type: edit-smiliarity
64
+ value: 71.19
65
+ quantized_by: bartowski
66
+ ---
67
+
68
+ ## Exllama v2 Quantizations of starcoder2-3b
69
+
70
+ Using <a href="https://github.com/turboderp/exllamav2/releases/tag/v0.0.19">turboderp's ExLlamaV2 v0.0.19</a> for quantization.
71
+
72
+ <b>The "main" branch only contains the measurement.json, download one of the other branches for the model (see below)</b>
73
+
74
+ Each branch contains an individual bits per weight, with the main one containing only the meaurement.json for further conversions.
75
+
76
+ Conversion was done using the default calibration dataset.
77
+
78
+ Default arguments used except when the bits per weight is above 6.0, at that point the lm_head layer is quantized at 8 bits per weight instead of the default 6.
79
+
80
+ Original model: https://huggingface.co/bigcode/starcoder2-3b
81
+
82
+
83
+ <a href="https://huggingface.co/bartowski/starcoder2-3b-exl2/tree/8_0">8.0 bits per weight</a>
84
+
85
+ <a href="https://huggingface.co/bartowski/starcoder2-3b-exl2/tree/6_5">6.5 bits per weight</a>
86
+
87
+ <a href="https://huggingface.co/bartowski/starcoder2-3b-exl2/tree/5_0">5.0 bits per weight</a>
88
+
89
+ <a href="https://huggingface.co/bartowski/starcoder2-3b-exl2/tree/4_25">4.25 bits per weight</a>
90
+
91
+ <a href="https://huggingface.co/bartowski/starcoder2-3b-exl2/tree/3_5">3.5 bits per weight</a>
92
+
93
+
94
+ ## Download instructions
95
+
96
+ With git:
97
+
98
+ ```shell
99
+ git clone --single-branch --branch 6_5 https://huggingface.co/bartowski/starcoder2-3b-exl2
100
+ ```
101
+
102
+ With huggingface hub (credit to TheBloke for instructions):
103
+
104
+ ```shell
105
+ pip3 install huggingface-hub
106
+ ```
107
+
108
+ To download the `main` (only useful if you only care about measurement.json) branch to a folder called `starcoder2-3b-exl2`:
109
+
110
+ ```shell
111
+ mkdir starcoder2-3b-exl2
112
+ huggingface-cli download bartowski/starcoder2-3b-exl2 --local-dir starcoder2-3b-exl2 --local-dir-use-symlinks False
113
+ ```
114
+
115
+ To download from a different branch, add the `--revision` parameter:
116
+
117
+ Linux:
118
+
119
+ ```shell
120
+ mkdir starcoder2-3b-exl2-6_5
121
+ huggingface-cli download bartowski/starcoder2-3b-exl2 --revision 6_5 --local-dir starcoder2-3b-exl2-6_5 --local-dir-use-symlinks False
122
+ ```
123
+
124
+ Windows (which apparently doesn't like _ in folders sometimes?):
125
+
126
+ ```shell
127
+ mkdir starcoder2-3b-exl2-6.5
128
+ huggingface-cli download bartowski/starcoder2-3b-exl2 --revision 6_5 --local-dir starcoder2-3b-exl2-6.5 --local-dir-use-symlinks False
129
+ ```
measurement.json ADDED
The diff for this file is too large to render. See raw diff