QMB15
/

Stheno-L2-13B-8bit-exl2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

QMB15 commited on Sep 15, 2023

Commit

c6f2e51

•

1 Parent(s): 4b9924d

Update README.md

Files changed (1) hide show

README.md +6 -6

README.md CHANGED Viewed

@@ -1,3 +1,9 @@
 This is a exllama V2 quantization of https://huggingface.co/TheBloke/Stheno-L2-13B-GPTQ
 Uses a target bpw of 8, intended for best quality on cards like a 3090 or similar.
 Includes measurement.json for convenience of quantizing to other sizes.
@@ -5,12 +11,6 @@ Calibration data: https://huggingface.co/datasets/wikitext/resolve/refs%2Fconver
----
-license: llama2
-language:
-- en
----
 <img src="https://w.forfun.com/fetch/cb/cba2205390e517bea1ea60ca0b491af4.jpeg"  style="width: 70%; min-width: 300px; display: block; margin: auto;">
 An experimental merging of Several Models using two various methods, [Ties-Merge](https://github.com/cg123/ties-merge) and [BlockMerge_Gradient](https://github.com/Gryphe/BlockMerge_Gradient)

+---
+license: llama2
+language:
+- en
+---
 This is a exllama V2 quantization of https://huggingface.co/TheBloke/Stheno-L2-13B-GPTQ
 Uses a target bpw of 8, intended for best quality on cards like a 3090 or similar.
 Includes measurement.json for convenience of quantizing to other sizes.
 <img src="https://w.forfun.com/fetch/cb/cba2205390e517bea1ea60ca0b491af4.jpeg"  style="width: 70%; min-width: 300px; display: block; margin: auto;">
 An experimental merging of Several Models using two various methods, [Ties-Merge](https://github.com/cg123/ties-merge) and [BlockMerge_Gradient](https://github.com/Gryphe/BlockMerge_Gradient)