Update README.md
Browse files
README.md
CHANGED
@@ -5,11 +5,15 @@ library_name: transformers
|
|
5 |
tags:
|
6 |
- mergekit
|
7 |
- merge
|
8 |
-
|
|
|
|
|
9 |
---
|
10 |
-
#
|
|
|
|
|
11 |
|
12 |
-
|
13 |
|
14 |
## Merge Details
|
15 |
### Merge Method
|
@@ -35,4 +39,4 @@ slices:
|
|
35 |
- sources:
|
36 |
- layer_range: [31, 32]
|
37 |
model: microsoft/Phi-3-small-8k-instruct
|
38 |
-
```
|
|
|
5 |
tags:
|
6 |
- mergekit
|
7 |
- merge
|
8 |
+
license: mit
|
9 |
+
language:
|
10 |
+
- en
|
11 |
---
|
12 |
+
# Phi-3-small-8k-instruct: 6 layers prunted
|
13 |
+
|
14 |
+
This is a layer-pruned language model created using [mergekit](https://github.com/cg123/mergekit). Layers to prune were selected based off of the average distances as follows:
|
15 |
|
16 |
+

|
17 |
|
18 |
## Merge Details
|
19 |
### Merge Method
|
|
|
39 |
- sources:
|
40 |
- layer_range: [31, 32]
|
41 |
model: microsoft/Phi-3-small-8k-instruct
|
42 |
+
```
|