uukuguy
/

speechless-mistral-moloras-7b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

uukuguy commited on Jan 9, 2024

Commit

4109ef2

·

1 Parent(s): 501b1ee

Update README.md

Files changed (1) hide show

README.md +4 -0

README.md CHANGED Viewed

@@ -10,6 +10,10 @@ license: apache-2.0
 <p><h1> speechless-mistral-moloras-7b </h1></p>
 [4-bit GGUF models for CPU+GPU inference](https://huggingface.co/uukuguy/speechless-mistral-moloras-7b/tree/main/GGUF)
 This model is the static version of moloras (Mixture-of-multi-LoRAs) based on the following 6 Mistral-based LoRa modules.

 <p><h1> speechless-mistral-moloras-7b </h1></p>
+* [AWQ model(s) for GPU inference.](https://huggingface.co/TheBloke/speechless-mistral-moloras-7B-AWQ)
+* [GPTQ models for GPU inference, with multiple quantisation parameter options.](https://huggingface.co/TheBloke/speechless-mistral-moloras-7B-GPTQ)
+* [2, 3, 4, 5, 6 and 8-bit GGUF models for CPU+GPU inference](https://huggingface.co/TheBloke/speechless-mistral-moloras-7B-GGUF)
 [4-bit GGUF models for CPU+GPU inference](https://huggingface.co/uukuguy/speechless-mistral-moloras-7b/tree/main/GGUF)
 This model is the static version of moloras (Mixture-of-multi-LoRAs) based on the following 6 Mistral-based LoRa modules.