license: apache-2.0 | |
tags: | |
- merge | |
- mergekit | |
- lazymergekit | |
- Qwen/Qwen-7B | |
- rinna/nekomata-7b | |
# without_japanese_apart_from_nekomata-7b_task_arithmetic | |
without_japanese_apart_from_nekomata-7b_task_arithmetic is a merge of the following models using [mergekit](https://github.com/cg123/mergekit): | |
* [Qwen/Qwen-7B](https://huggingface.co/Qwen/Qwen-7B) | |
* [rinna/nekomata-7b](https://huggingface.co/rinna/nekomata-7b) | |
## 🧩 Configuration | |
```yaml | |
models: | |
- model: Qwen/Qwen-7B | |
parameters: | |
weight: 0.5 | |
- model: rinna/nekomata-7b | |
parameters: | |
weight: 0.5 | |
merge_method: task_arithmetic | |
base_model: rinna/nekomata-7b | |
parameters: | |
normalize: true | |
int8_mask: true | |
dtype: bfloat16 | |
``` |