|
---
|
|
base_model:
|
|
- mrfakename/mistral-small-3.1-24b-base-2503-hf
|
|
- Doctor-Shotgun/MS3.2-24B-Magnum-Diamond
|
|
- PocketDoc/Dans-PersonalityEngine-V1.3.0-24b
|
|
library_name: transformers
|
|
tags:
|
|
- mergekit
|
|
- merge
|
|
pipeline_tag: text-generation
|
|
license: apache-2.0
|
|
---
|
|
# Modicum-of-Doubt-v1-24B
|
|
|
|
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
|
|
|
The model vision component was excised from all merge contributions.
|
|
|
|
Creative text generation outputs seem to trend toward the short side, sometimes to the point of feeling choppy, hence the model name. This model is not the most stellar, but the result is interesting, going against the individual tendency of the two contributing models toward longer outputs.
|
|
|
|
Tested sampler settings: temperature 1.0, minP 0.02
|
|
|
|
## Merge Details
|
|
### Merge Method
|
|
|
|
This model was merged using the [Task Arithmetic](https://arxiv.org/abs/2212.04089) merge method using [mrfakename/mistral-small-3.1-24b-base-2503-hf](https://huggingface.co/mrfakename/mistral-small-3.1-24b-base-2503-hf) as a base.
|
|
|
|
### Models Merged
|
|
|
|
The following models were included in the merge:
|
|
* [Doctor-Shotgun/MS3.2-24B-Magnum-Diamond](https://huggingface.co/Doctor-Shotgun/MS3.2-24B-Magnum-Diamond)
|
|
* [PocketDoc/Dans-PersonalityEngine-V1.3.0-24b](https://huggingface.co/PocketDoc/Dans-PersonalityEngine-V1.3.0-24b)
|
|
|
|
### Configuration
|
|
|
|
The following YAML configuration was used to produce this model:
|
|
|
|
```yaml
|
|
base_model: mrfakename/mistral-small-3.1-24b-base-2503-hf
|
|
dtype: bfloat16
|
|
merge_method: task_arithmetic
|
|
parameters:
|
|
normalize: true
|
|
models:
|
|
- model: mrfakename/mistral-small-3.1-24b-base-2503-hf
|
|
- model: Doctor-Shotgun/MS3.2-24B-Magnum-Diamond
|
|
parameters:
|
|
weight: 0.5
|
|
- model: PocketDoc/Dans-PersonalityEngine-V1.3.0-24b
|
|
parameters:
|
|
weight: 0.5
|
|
|
|
```
|
|
|