llama3.1-8b-swallow-us4fin-dare_ties-d5w5_d5w5
This is a merge of pre-trained language models created using mergekit.
Merge Details
Merge Method
This model was merged using the DARE TIES merge method using meta-llama/Llama-3.1-8B-Instruct as a base.
Models Merged
The following models were included in the merge:
Configuration
The following YAML configuration was used to produce this model:
models:
- model: tokyotech-llm/Llama-3.1-Swallow-8B-Instruct-v0.3
parameters:
density: 0.5
weight: 0.5
- model: us4/fin-llama3.1-8b
parameters:
density: 0.5
weight: 0.5
merge_method: dare_ties
base_model: meta-llama/Llama-3.1-8B-Instruct
parameters:
normalize: true
dtype: float16
- Downloads last month
- 0
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.
Model tree for rsh345/llama3.1-8b-swallow-us4fin-dare_ties-d5w5_d5w5
Merge model
this model