metadata
base_model:
- mistralai/Mixtral-8x7B-Instruct-v0.1
- NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO
tags:
- mergekit
- merge
lesser-hermes
This is a merge of pre-trained language models created using mergekit. We've been using this as one of the experimental ingredients to help stabilize the monkey-typewriter merges, and it's kinda okay at that.
Merge Details
Merge Method
This model was merged using the DARE TIES merge method using mistralai/Mixtral-8x7B-Instruct-v0.1 as a base.
Models Merged
The following models were included in the merge:
Configuration
The following YAML configuration was used to produce this model:
models:
# dont bagel me bro
- model: NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO
parameters:
density: 0.25
weight: 0.3
- model: mistralai/Mixtral-8x7B-Instruct-v0.1
parameters:
density: 0.5
weight: 1
merge_method: dare_ties
base_model: mistralai/Mixtral-8x7B-Instruct-v0.1
parameters:
#normalize: false
#int8_mask: true
dtype: bfloat16