Update README.md
Browse files
README.md
CHANGED
@@ -8,19 +8,19 @@ tags:
|
|
8 |
- not-for-all-audiences
|
9 |
license: apache-2.0
|
10 |
---
|
11 |
-
#
|
12 |
|
13 |
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
14 |
|
15 |
## Merge Details
|
16 |
### Merge Method
|
17 |
|
18 |
-
This model was merged using the [TIES](https://arxiv.org/abs/2306.01708) merge method using
|
19 |
|
20 |
### Models Merged
|
21 |
|
22 |
The following models were included in the merge:
|
23 |
-
*
|
24 |
|
25 |
### Configuration
|
26 |
|
@@ -28,14 +28,14 @@ The following YAML configuration was used to produce this model:
|
|
28 |
|
29 |
```yaml
|
30 |
models:
|
31 |
-
- model:
|
32 |
#no parameters necessary for base model
|
33 |
-
- model:
|
34 |
parameters:
|
35 |
density: 1
|
36 |
weight: 1
|
37 |
merge_method: ties
|
38 |
-
base_model:
|
39 |
tokenizer_source: base
|
40 |
chat_template: "auto"
|
41 |
parameters:
|
|
|
8 |
- not-for-all-audiences
|
9 |
license: apache-2.0
|
10 |
---
|
11 |
+
# VICIOUS_MESH-12B-NEMO
|
12 |
|
13 |
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
14 |
|
15 |
## Merge Details
|
16 |
### Merge Method
|
17 |
|
18 |
+
This model was merged using the [TIES](https://arxiv.org/abs/2306.01708) merge method using Mistral-Nemo-Instruct-2407 as a base.
|
19 |
|
20 |
### Models Merged
|
21 |
|
22 |
The following models were included in the merge:
|
23 |
+
* VICIOUS_MESH-12B-BETA
|
24 |
|
25 |
### Configuration
|
26 |
|
|
|
28 |
|
29 |
```yaml
|
30 |
models:
|
31 |
+
- model: Mistral-Nemo-Instruct-2407
|
32 |
#no parameters necessary for base model
|
33 |
+
- model: VICIOUS_MESH-12B-BETA
|
34 |
parameters:
|
35 |
density: 1
|
36 |
weight: 1
|
37 |
merge_method: ties
|
38 |
+
base_model: Mistral-Nemo-Instruct-2407
|
39 |
tokenizer_source: base
|
40 |
chat_template: "auto"
|
41 |
parameters:
|