Update README.md
Browse files
README.md
CHANGED
@@ -10,6 +10,9 @@ This is Starling-LM-10.7B-beta, a depth-upscaled version of [Nexusflow/Starling-
|
|
10 |
|
11 |
This model is intended to be used as a drop-in upgrade from the original 7 billion parameter model.
|
12 |
|
|
|
|
|
|
|
13 |
# ExLlamaV2 quantizations (courtesy of [blockblockblock](https://huggingface.co/blockblockblock))
|
14 |
- [2.5 bpw](https://huggingface.co/blockblockblock/Starling-LM-10.7B-beta-bpw2.5)
|
15 |
- [3 bpw](https://huggingface.co/blockblockblock/Starling-LM-10.7B-beta-bpw3)
|
|
|
10 |
|
11 |
This model is intended to be used as a drop-in upgrade from the original 7 billion parameter model.
|
12 |
|
13 |
+
# GGUF quantizations (courtesy of bartowski)
|
14 |
+
See [bartowski/Starling-LM-10.7B-beta-GGUF](https://huggingface.co/bartowski/Starling-LM-10.7B-beta-GGUF)
|
15 |
+
|
16 |
# ExLlamaV2 quantizations (courtesy of [blockblockblock](https://huggingface.co/blockblockblock))
|
17 |
- [2.5 bpw](https://huggingface.co/blockblockblock/Starling-LM-10.7B-beta-bpw2.5)
|
18 |
- [3 bpw](https://huggingface.co/blockblockblock/Starling-LM-10.7B-beta-bpw3)
|