license: apache-2.0 | |
EDIT: Base Mistral, but with some minor head trauma. Promising in theory, needs finetuning, but only really outperforms the 14b in size. | |
A 11b Mistral base model, based on the NeverSleep recipe. | |
### Recipe | |
slices | |
- sources: | |
- | |
- model: mistralai/Mistral-7B-v0.1 | |
- | |
layer_range: [0, 24] | |
- sources: | |
- | |
- model: mistralai/Mistral-7B-v0.1 | |
- | |
layer_range: [8, 32] | |
merge_method: passthrough | |
dtype: bfloat16 |