Update README.md
Browse files
README.md
CHANGED
@@ -1,13 +1,14 @@
|
|
1 |
-
This is a exllama V2 quantization of https://huggingface.co/Gryphe/MythoMax-L2-13b
|
2 |
-
Uses a target bpw of 8, intended for best quality on cards like a 3090 or similar.
|
3 |
-
Includes measurement.json for convenience of quantizing to other sizes.
|
4 |
-
Calibration data: https://huggingface.co/datasets/wikitext/resolve/refs%2Fconvert%2Fparquet/wikitext-2-v1/test/0000.parquet
|
5 |
-
|
6 |
---
|
7 |
license: other
|
8 |
language:
|
9 |
- en
|
10 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
11 |
An improved, potentially even perfected variant of MythoMix, my [MythoLogic-L2](https://huggingface.co/Gryphe/MythoLogic-L2-13b) and [Huginn](https://huggingface.co/The-Face-Of-Goonery/Huginn-13b-FP16) merge using a highly experimental tensor type merge technique. The main difference with MythoMix is that I allowed more of Huginn to intermingle with the single tensors located at the front and end of a model, resulting in increased coherency across the entire structure.
|
12 |
|
13 |
The script and the acccompanying templates I used to produce both can [be found here](https://github.com/Gryphe/BlockMerge_Gradient/tree/main/YAML).
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
license: other
|
3 |
language:
|
4 |
- en
|
5 |
---
|
6 |
+
|
7 |
+
This is an exllama V2 quantization of https://huggingface.co/Gryphe/MythoMax-L2-13b
|
8 |
+
Uses a target bpw of 8, intended for best quality on cards like a 3090 or similar.
|
9 |
+
Includes measurement.json for convenience of quantizing to other sizes.
|
10 |
+
Calibration data: https://huggingface.co/datasets/wikitext/resolve/refs%2Fconvert%2Fparquet/wikitext-2-v1/test/0000.parquet
|
11 |
+
|
12 |
An improved, potentially even perfected variant of MythoMix, my [MythoLogic-L2](https://huggingface.co/Gryphe/MythoLogic-L2-13b) and [Huginn](https://huggingface.co/The-Face-Of-Goonery/Huginn-13b-FP16) merge using a highly experimental tensor type merge technique. The main difference with MythoMix is that I allowed more of Huginn to intermingle with the single tensors located at the front and end of a model, resulting in increased coherency across the entire structure.
|
13 |
|
14 |
The script and the acccompanying templates I used to produce both can [be found here](https://github.com/Gryphe/BlockMerge_Gradient/tree/main/YAML).
|