bgsach commited on
Commit
16bf712
·
1 Parent(s): 4e4b19f

Upload folder using huggingface_hub

Browse files
Files changed (4) hide show
  1. README.md +19 -0
  2. config.json +6 -0
  3. model.bin +3 -0
  4. vocabulary.json +0 -0
README.md ADDED
@@ -0,0 +1,19 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ ---
3
+ license: llama2
4
+ tags:
5
+ - code
6
+ ---
7
+
8
+ This is a quantized version of **WizardLM/WizardCoder-Python-7B-V1.0**, quantized using [ctranslate2](https://github.com/OpenNMT/CTranslate2) (see inference instructions there).
9
+
10
+ **The license/caveats/intended usage is the same as the original model**.
11
+ The quality of its output may have
12
+ been negatively affected by the quantization process.
13
+
14
+
15
+ The command run to quantize the model was:
16
+
17
+ `ct2-transformers-converter --model ./models-hf/WizardLM/WizardCoder-Python-7B-V1.0 --quantization float16 --output_dir ./models-ct/WizardLM/WizardCoder-Python-7B-V1.0-ct2-float16`
18
+
19
+ The quantization was run on a 'high-mem', CPU only (8 core, 51GB) colab instance and took approximately 10 minutes.
config.json ADDED
@@ -0,0 +1,6 @@
 
 
 
 
 
 
 
1
+ {
2
+ "bos_token": "</s>",
3
+ "eos_token": "</s>",
4
+ "layer_norm_epsilon": 1e-05,
5
+ "unk_token": "</s>"
6
+ }
model.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c88b6ed131dcb3f0f4c3763018e0557dcff933ec2b658e298d3f2a4fa3671a90
3
+ size 13476866371
vocabulary.json ADDED
The diff for this file is too large to render. See raw diff