softwareweaver
/

Twilight-XL-2-195B-Mistral-Large-2411-Behemoth

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

softwareweaver commited on Nov 25, 2024

Commit

cc3e5a2

·

verified ·

1 Parent(s): da758c9

Update README.md

Files changed (1) hide show

README.md +2 -20

README.md CHANGED Viewed

@@ -10,12 +10,9 @@ tags:
 ---
 # Twilight-XL-2
-This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
-## Merge Details
-### Merge Method
-This model was merged using the passthrough merge method.
 ### Models Merged
@@ -23,18 +20,3 @@ The following models were included in the merge:
 * [TheDrummer/Behemoth-123B-v2.2](https://huggingface.co/TheDrummer/Behemoth-123B-v2.2)
 * [mistralai/Mistral-Large-Instruct-2411](https://huggingface.co/mistralai/Mistral-Large-Instruct-2411)
-### Configuration
-The following YAML configuration was used to produce this model:
-```yaml
-dtype: bfloat16
-merge_method: passthrough
-slices:
-- sources:
-  - layer_range: [0, 70]
-    model: mistralai/Mistral-Large-Instruct-2411
-- sources:
-  - layer_range: [18, 88]
-    model: TheDrummer/Behemoth-123B-v2.2
-```

 ---
 # Twilight-XL-2
+Awesome model for creative story writing with 130K context.
+Created using [mergekit](https://github.com/cg123/mergekit) by @softwareweaver. Use the prompt format that Mistral Large uses.
 ### Models Merged
 * [TheDrummer/Behemoth-123B-v2.2](https://huggingface.co/TheDrummer/Behemoth-123B-v2.2)
 * [mistralai/Mistral-Large-Instruct-2411](https://huggingface.co/mistralai/Mistral-Large-Instruct-2411)