intervitens
/

Mixtral-8x7B-Instruct-v0.1-3.75bpw-h6-exl2

Text Generation

text-generation-inference

Model card Files Files and versions Community

intervitens commited on Dec 17, 2023

Commit

daa1e59

•

1 Parent(s): 387fa84

Update README.md

Files changed (1) hide show

README.md +12 -0

README.md CHANGED Viewed

@@ -8,6 +8,18 @@ language:
 - en
 inference: false
 ---
 # Model Card for Mixtral-8x7B
 The Mixtral-8x7B Large Language Model (LLM) is a pretrained generative Sparse Mixture of Experts. The Mixtral-8x7B outperforms Llama 2 70B on most benchmarks we tested.

 - en
 inference: false
 ---
+Quantized using samples of 8192 tokens from the default ExllamaV2 dataset.
+Requires ExllamaV2 version 0.0.11 and up.
+Original model link: [Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1)
+Original model README below.
+***
 # Model Card for Mixtral-8x7B
 The Mixtral-8x7B Large Language Model (LLM) is a pretrained generative Sparse Mixture of Experts. The Mixtral-8x7B outperforms Llama 2 70B on most benchmarks we tested.