Upload folder using huggingface_hub
Browse files
README.md
CHANGED
@@ -5,14 +5,14 @@ tags:
|
|
5 |
- mergekit
|
6 |
- lazymergekit
|
7 |
- deepseek-ai/DeepSeek-R1-Distill-Llama-8B
|
8 |
-
-
|
9 |
---
|
10 |
|
11 |
# deepseek-ai/DeepSeek-R1-Distill-Llama-8B
|
12 |
|
13 |
deepseek-ai/DeepSeek-R1-Distill-Llama-8B is a merge of the following models using [mergekit](https://github.com/cg123/mergekit):
|
14 |
* [deepseek-ai/DeepSeek-R1-Distill-Llama-8B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-8B)
|
15 |
-
* [
|
16 |
|
17 |
## 🧩 Configuration
|
18 |
|
@@ -21,7 +21,7 @@ slices:
|
|
21 |
- sources:
|
22 |
- model: deepseek-ai/DeepSeek-R1-Distill-Llama-8B
|
23 |
layer_range: [0, 32]
|
24 |
-
- model:
|
25 |
layer_range: [0, 32]
|
26 |
merge_method: slerp
|
27 |
base_model: unsloth/Llama-3.1-8B-Instruct
|
|
|
5 |
- mergekit
|
6 |
- lazymergekit
|
7 |
- deepseek-ai/DeepSeek-R1-Distill-Llama-8B
|
8 |
+
- unsloth/Llama-3.1-8B-Instruct
|
9 |
---
|
10 |
|
11 |
# deepseek-ai/DeepSeek-R1-Distill-Llama-8B
|
12 |
|
13 |
deepseek-ai/DeepSeek-R1-Distill-Llama-8B is a merge of the following models using [mergekit](https://github.com/cg123/mergekit):
|
14 |
* [deepseek-ai/DeepSeek-R1-Distill-Llama-8B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-8B)
|
15 |
+
* [unsloth/Llama-3.1-8B-Instruct](https://huggingface.co/unsloth/Llama-3.1-8B-Instruct)
|
16 |
|
17 |
## 🧩 Configuration
|
18 |
|
|
|
21 |
- sources:
|
22 |
- model: deepseek-ai/DeepSeek-R1-Distill-Llama-8B
|
23 |
layer_range: [0, 32]
|
24 |
+
- model: unsloth/Llama-3.1-8B-Instruct
|
25 |
layer_range: [0, 32]
|
26 |
merge_method: slerp
|
27 |
base_model: unsloth/Llama-3.1-8B-Instruct
|