milkowski commited on
Commit
53152a2
·
verified ·
1 Parent(s): 55f9ff1

Quantized models

Browse files
.gitattributes CHANGED
@@ -33,3 +33,9 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ deepseek-coder-7b-instruct-v1.5.f16.gguf filter=lfs diff=lfs merge=lfs -text
37
+ deepseek-coder-7b-instruct-v1.5.q4_0.gguf filter=lfs diff=lfs merge=lfs -text
38
+ deepseek-coder-7b-instruct-v1.5.q4_k.gguf filter=lfs diff=lfs merge=lfs -text
39
+ deepseek-coder-7b-instruct-v1.5.q5_k.gguf filter=lfs diff=lfs merge=lfs -text
40
+ deepseek-coder-7b-instruct-v1.5.q6_k.gguf filter=lfs diff=lfs merge=lfs -text
41
+ deepseek-coder-7b-instruct-v1.5.q8_0.gguf filter=lfs diff=lfs merge=lfs -text
LICENSE ADDED
File without changes
Modelfile ADDED
@@ -0,0 +1,7 @@
 
 
 
 
 
 
 
 
1
+ FROM model.gguf
2
+ TEMPLATE """{{ .System }}
3
+ ### Instruction:
4
+ {{ .Prompt }}
5
+ ### Response:
6
+ """
7
+ SYSTEM """You are an AI programming assistant, utilizing the Deepseek Coder model, developed by Deepseek Company, and you only answer questions related to computer science. For politically sensitive questions, security and privacy issues, and other non-computer science questions, you will refuse to answer."""
README.md ADDED
@@ -0,0 +1,55 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: other
3
+ license_name: deepseek
4
+ license_link: LICENSE
5
+ ---
6
+
7
+ <p align="center">
8
+ <img width="1000px" alt="DeepSeek Coder" src="https://github.com/deepseek-ai/DeepSeek-Coder/blob/main/pictures/logo.png?raw=true">
9
+ </p>
10
+ <p align="center"><a href="https://www.deepseek.com/">[🏠Homepage]</a> | <a href="https://coder.deepseek.com/">[🤖 Chat with DeepSeek Coder]</a> | <a href="https://discord.gg/Tc7c45Zzu5">[Discord]</a> | <a href="https://github.com/guoday/assert/blob/main/QR.png?raw=true">[Wechat(微信)]</a> </p>
11
+ <hr>
12
+
13
+
14
+
15
+
16
+ ### 1. Introduction of Deepseek-Coder-7B-Instruct v1.5
17
+
18
+ Deepseek-Coder-7B-Instruct-v1.5 is continue pre-trained from Deepseek-LLM 7B on 2T tokens by employing a window size of 4K and next token prediction objective, and then fine-tuned on 2B tokens of instruction data.
19
+
20
+ - **Home Page:** [DeepSeek](https://deepseek.com/)
21
+ - **Repository:** [deepseek-ai/deepseek-coder](https://github.com/deepseek-ai/deepseek-coder)
22
+ - **Chat With DeepSeek Coder:** [DeepSeek-Coder](https://coder.deepseek.com/)
23
+
24
+
25
+
26
+ ### 2. Evaluation Results
27
+ <img width="1000px" alt="DeepSeek Coder" src="https://cdn-uploads.huggingface.co/production/uploads/6538815d1bdb3c40db94fbfa/xOtCTW5xdoLCKY4FR6tri.png">
28
+
29
+
30
+
31
+ ### 3. How to Use
32
+ Here give some examples of how to use our model.
33
+ #### Chat Model Inference
34
+ ```python
35
+ from transformers import AutoTokenizer, AutoModelForCausalLM
36
+ tokenizer = AutoTokenizer.from_pretrained("deepseek-ai/deepseek-coder-7b-instruct-v1.5", trust_remote_code=True)
37
+ model = AutoModelForCausalLM.from_pretrained("deepseek-ai/deepseek-coder-7b-instruct-v1.5", trust_remote_code=True).cuda()
38
+ messages=[
39
+ { 'role': 'user', 'content': "write a quick sort algorithm in python."}
40
+ ]
41
+ inputs = tokenizer.apply_chat_template(messages, add_generation_prompt=True, return_tensors="pt").to(model.device)
42
+
43
+ outputs = model.generate(inputs, max_new_tokens=512, do_sample=False, top_k=50, top_p=0.95, num_return_sequences=1, eos_token_id=tokenizer.eos_token_id)
44
+ print(tokenizer.decode(outputs[0][len(inputs[0]):], skip_special_tokens=True))
45
+ ```
46
+
47
+ ### 4. License
48
+ This code repository is licensed under the MIT License. The use of DeepSeek Coder models is subject to the Model License. DeepSeek Coder supports commercial use.
49
+
50
+ See the [LICENSE-MODEL](https://github.com/deepseek-ai/deepseek-coder/blob/main/LICENSE-MODEL) for more details.
51
+
52
+ ### 5. Contact
53
+
54
+ If you have any questions, please raise an issue or contact us at [[email protected]](mailto:[email protected]).
55
+
deepseek-coder-7b-instruct-v1.5.f16.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f4399d415270a9da85dc4fbdcbd8c3ef378b5087ac8d3a9e0a8456410c076194
3
+ size 13825630528
deepseek-coder-7b-instruct-v1.5.q4_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:70fd6d46febb4b08b76d63fd21ca79a65fd663fd4965316aa196e9754e946b55
3
+ size 4000473440
deepseek-coder-7b-instruct-v1.5.q4_k.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d76948555c437388dbb32efe09ccb1059b154202773ca3e9bbfa412fbab0e22e
3
+ size 4223770976
deepseek-coder-7b-instruct-v1.5.q5_k.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2f1b5ad7e7c48a83e8717852ab944f0c8580eb91c034ec1353ce6154bcf73c6b
3
+ size 4926841184
deepseek-coder-7b-instruct-v1.5.q6_k.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:286c8d46182e933bf90e4ef4395f21850232a786fa752daa674aafc7bafd899c
3
+ size 5673853280
deepseek-coder-7b-instruct-v1.5.q8_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7836fd479d16ae9beebf38de444de1fb86d257499b3e6dd0bc77c4d1b16eb45c
3
+ size 7347396960