paloalma
/

ECE-TW3-JRGL-V1

Text Generation

ShinojiResearch/Senku-70B-Full

152334H/miqu-1-70b-sf

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

paloalma commited on Apr 19, 2024

Commit

f1916d0

•

1 Parent(s): 58e545c

Update README.md

Files changed (1) hide show

README.md +19 -0

README.md CHANGED Viewed

@@ -31,6 +31,25 @@ ECE-TW3-JRGL-V1 is a merge of the following models using **[mergekit](https://g
 * [ShinojiResearch/Senku-70B-Full](https://huggingface.co/ShinojiResearch/Senku-70B-Full)
 * [152334H/miqu-1-70b-sf](https://huggingface.co/152334H/miqu-1-70b-sf)
 ## Results
 - ECE-TW3-JRGL-v1 scores 83.07 on [EQ-Bench V2](https://eqbench.com/index.html)

 * [ShinojiResearch/Senku-70B-Full](https://huggingface.co/ShinojiResearch/Senku-70B-Full)
 * [152334H/miqu-1-70b-sf](https://huggingface.co/152334H/miqu-1-70b-sf)
+```yaml
+slices:
+  - sources:
+      - model: ShinojiResearch/Senku-70B-Full
+        layer_range: [0, 80]
+      - model: 152334H/miqu-1-70b-sf
+        layer_range: [0, 80]
+merge_method: slerp
+base_model: 152334H/miqu-1-70b-sf
+parameters:
+  t:
+    - filter: self_attn
+      value: [0, 0.5, 0.3, 0.7, 1]
+    - filter: mlp
+      value: [1, 0.5, 0.7, 0.3, 0]
+    - value: 0.5
+dtype: float16
+```
 ## Results
 - ECE-TW3-JRGL-v1 scores 83.07 on [EQ-Bench V2](https://eqbench.com/index.html)