RichardErkhov commited on
Commit
1006ae1
·
verified ·
1 Parent(s): e11dd37

uploaded readme

Browse files
Files changed (1) hide show
  1. README.md +64 -0
README.md ADDED
@@ -0,0 +1,64 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ Quantization made by Richard Erkhov.
2
+
3
+ [Github](https://github.com/RichardErkhov)
4
+
5
+ [Discord](https://discord.gg/pvy7H8DZMG)
6
+
7
+ [Request more models](https://github.com/RichardErkhov/quant_request)
8
+
9
+
10
+ Mistral-7B-v0.1 - bnb 4bits
11
+ - Model creator: https://huggingface.co/mistralai/
12
+ - Original model: https://huggingface.co/mistralai/Mistral-7B-v0.1/
13
+
14
+
15
+
16
+
17
+ Original model description:
18
+ ---
19
+ license: apache-2.0
20
+ pipeline_tag: text-generation
21
+ language:
22
+ - en
23
+ tags:
24
+ - pretrained
25
+ inference:
26
+ parameters:
27
+ temperature: 0.7
28
+ ---
29
+
30
+ # Model Card for Mistral-7B-v0.1
31
+
32
+ The Mistral-7B-v0.1 Large Language Model (LLM) is a pretrained generative text model with 7 billion parameters.
33
+ Mistral-7B-v0.1 outperforms Llama 2 13B on all benchmarks we tested.
34
+
35
+ For full details of this model please read our [paper](https://arxiv.org/abs/2310.06825) and [release blog post](https://mistral.ai/news/announcing-mistral-7b/).
36
+
37
+ ## Model Architecture
38
+
39
+ Mistral-7B-v0.1 is a transformer model, with the following architecture choices:
40
+ - Grouped-Query Attention
41
+ - Sliding-Window Attention
42
+ - Byte-fallback BPE tokenizer
43
+
44
+ ## Troubleshooting
45
+
46
+ - If you see the following error:
47
+ ```
48
+ KeyError: 'mistral'
49
+ ```
50
+ - Or:
51
+ ```
52
+ NotImplementedError: Cannot copy out of meta tensor; no data!
53
+ ```
54
+
55
+ Ensure you are utilizing a stable version of Transformers, 4.34.0 or newer.
56
+
57
+ ## Notice
58
+
59
+ Mistral 7B is a pretrained base model and therefore does not have any moderation mechanisms.
60
+
61
+ ## The Mistral AI Team
62
+
63
+ Albert Jiang, Alexandre Sablayrolles, Arthur Mensch, Chris Bamford, Devendra Singh Chaplot, Diego de las Casas, Florian Bressand, Gianna Lengyel, Guillaume Lample, Lélio Renard Lavaud, Lucile Saulnier, Marie-Anne Lachaux, Pierre Stock, Teven Le Scao, Thibaut Lavril, Thomas Wang, Timothée Lacroix, William El Sayed.
64
+