Merge-Llama-3-8B
Merge-Llama-3-8B is a merge of the following models using mergekit:
🧩 Configuration
'''yaml slices:
- sources:
- model: meta-llama/Meta-Llama-3-8B-Instruct layer_range: [0, 32]
- model: MLP-KTLim/llama-3-Korean-Bllossom-8B layer_range: [0, 32] merge_method: slerp base_model: meta-llama/Meta-Llama-3-8B-Instruct parameters: t:
- filter: self_attn value: [0, 0.5, 0.3, 0.7, 1]
- filter: mlp value: [1, 0.5, 0.7, 0.3, 0]
- value: 0.5 dtype: bfloat16 '''
- Downloads last month
- 3