Merge2-Llama-3.1-8B / README.md
yelim24's picture
Upload folder using huggingface_hub
a5f2f16 verified
metadata
license: apache-2.0
tags:
  - merge
  - mergekit
  - lazymergekit
  - NCSOFT/Llama-VARCO-8B-Instruct
  - sh2orc/Llama-3.1-Korean-8B-Instruct

Merge2-Llama-3.1-8B

Merge2-Llama-3.1-8B is a merge of the following models using mergekit:

🧩 Configuration

'''yaml slices:

  • sources:
    • model: NCSOFT/Llama-VARCO-8B-Instruct layer_range: [0, 32]
    • model: sh2orc/Llama-3.1-Korean-8B-Instruct layer_range: [0, 32] merge_method: slerp base_model: NCSOFT/Llama-VARCO-8B-Instruct parameters: t:
    • filter: self_attn value: [0, 0.5, 0.3, 0.7, 1]
    • filter: mlp value: [1, 0.5, 0.7, 0.3, 0]
    • value: 0.5 dtype: bfloat16 '''