joey00072
/

Llama_r1_instruct_merge

deepseek-ai/DeepSeek-R1-Distill-Llama-8B

unsloth/Llama-3.1-8B-Instruct

Model card Files Files and versions Community

Llama_r1_instruct_merge / README.md

joey00072's picture

Upload folder using huggingface_hub

ad6f1a2 verified about 1 month ago

|

history blame contribute delete

955 Bytes

	---
	license: apache-2.0
	tags:
	- merge
	- mergekit
	- lazymergekit
	- deepseek-ai/DeepSeek-R1-Distill-Llama-8B
	- unsloth/Llama-3.1-8B-Instruct
	---

	# deepseek-ai/DeepSeek-R1-Distill-Llama-8B

	deepseek-ai/DeepSeek-R1-Distill-Llama-8B is a merge of the following models using [mergekit](https://github.com/cg123/mergekit):
	* [deepseek-ai/DeepSeek-R1-Distill-Llama-8B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-8B)
	* [unsloth/Llama-3.1-8B-Instruct](https://huggingface.co/unsloth/Llama-3.1-8B-Instruct)

	## 🧩 Configuration

	```yaml
	slices:
	- sources:
	- model: deepseek-ai/DeepSeek-R1-Distill-Llama-8B
	layer_range: [0, 32]
	- model: unsloth/Llama-3.1-8B-Instruct
	layer_range: [0, 32]
	merge_method: slerp
	base_model: unsloth/Llama-3.1-8B-Instruct
	parameters:
	t:
	- filter: self_attn
	value: [0, 0.5, 0.3, 0.7, 1]
	- filter: mlp
	value: [1, 0.5, 0.7, 0.3, 0]
	- value: 0.5
	dtype: bfloat16

	```