File size: 2,192 Bytes
654fc88
 
 
 
 
 
 
 
 
 
 
 
 
 
8381397
 
 
654fc88
 
 
 
 
 
 
 
 
 
0825411
654fc88
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
---
base_model:
- NousResearch/Yarn-Mistral-7b-128k
- Test157t/Kunocchini-1.1-7b
library_name: transformers
tags:
- mistral
- quantized
- text-generation-inference
- merge
- mergekit
pipeline_tag: text-generation
inference: false
---

# Quantazing and uploading...

# **GGUF-Imatrix quantizations for [Test157t/Kunocchini-1.2-7b-longtext](https://huggingface.co/Test157t/Kunocchini-1.2-7b-longtext/).**

SillyTavern preset files for the previous version are located [here](https://huggingface.co/Test157t/Kunocchini-7b-128k-test/tree/main/ST%20presets).

*If you want any specific quantization to be added, feel free to ask.*

All credits belong to the [creator](https://huggingface.co/Test157t/).

`Base⇢ GGUF(F16)⇢ Imatrix(F16)⇢ GGUF-Imatrix(Quants)`

The new **IQ3_S** merged today has shown to be better than the old Q3_K_S, but will only be supported in `koboldcpp-1.60` or newer.

Using [llama.cpp](https://github.com/ggerganov/llama.cpp/)-[b2254](https://github.com/ggerganov/llama.cpp/releases/tag/b2254).

For --imatrix data, `imatrix-Kunocchini-1.2-7b-longtext-F16.dat` was used.

# Original model information:

Thanks to @Epiculous for the dope model/ help with llm backends and support overall.

Id like to also thank @kalomaze for the dope sampler additions to ST.

@SanjiWatsuki Thank you very much for the help, and the model!
![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/642265bc01c62c1e4102dc36/1M16DsWk39CtFz2SjmYGr.jpeg)

This model was merged using the [DARE](https://arxiv.org/abs/2311.03099) [TIES](https://arxiv.org/abs/2306.01708).

### Models Merged

The following models were included in the merge:
* [NousResearch/Yarn-Mistral-7b-128k](https://huggingface.co/NousResearch/Yarn-Mistral-7b-128k) + [Test157t/Kunocchini-1.1-7b](https://huggingface.co/Test157t/Kunocchini-1.1-7b)

### Configuration

The following YAML configuration was used to produce this model:

```yaml
merge_method: dare_ties
base_model: Test157t/Kunocchini-1.1-7b
parameters:
  normalize: true
models:
  - model: NousResearch/Yarn-Mistral-7b-128k
    parameters:
      weight: 1
  - model: Test157t/Kunocchini-1.1-7b
    parameters:
      weight: 1
dtype: float16
```