softwareweaver
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -10,12 +10,9 @@ tags:
|
|
10 |
---
|
11 |
# Twilight-XL-2
|
12 |
|
13 |
-
|
14 |
|
15 |
-
|
16 |
-
### Merge Method
|
17 |
-
|
18 |
-
This model was merged using the passthrough merge method.
|
19 |
|
20 |
### Models Merged
|
21 |
|
@@ -23,18 +20,3 @@ The following models were included in the merge:
|
|
23 |
* [TheDrummer/Behemoth-123B-v2.2](https://huggingface.co/TheDrummer/Behemoth-123B-v2.2)
|
24 |
* [mistralai/Mistral-Large-Instruct-2411](https://huggingface.co/mistralai/Mistral-Large-Instruct-2411)
|
25 |
|
26 |
-
### Configuration
|
27 |
-
|
28 |
-
The following YAML configuration was used to produce this model:
|
29 |
-
|
30 |
-
```yaml
|
31 |
-
dtype: bfloat16
|
32 |
-
merge_method: passthrough
|
33 |
-
slices:
|
34 |
-
- sources:
|
35 |
-
- layer_range: [0, 70]
|
36 |
-
model: mistralai/Mistral-Large-Instruct-2411
|
37 |
-
- sources:
|
38 |
-
- layer_range: [18, 88]
|
39 |
-
model: TheDrummer/Behemoth-123B-v2.2
|
40 |
-
```
|
|
|
10 |
---
|
11 |
# Twilight-XL-2
|
12 |
|
13 |
+
Awesome model for creative story writing with 130K context.
|
14 |
|
15 |
+
Created using [mergekit](https://github.com/cg123/mergekit) by @softwareweaver. Use the prompt format that Mistral Large uses.
|
|
|
|
|
|
|
16 |
|
17 |
### Models Merged
|
18 |
|
|
|
20 |
* [TheDrummer/Behemoth-123B-v2.2](https://huggingface.co/TheDrummer/Behemoth-123B-v2.2)
|
21 |
* [mistralai/Mistral-Large-Instruct-2411](https://huggingface.co/mistralai/Mistral-Large-Instruct-2411)
|
22 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|