ddh0 commited on
Commit
30ab6db
·
verified ·
1 Parent(s): c64d81b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -34
README.md CHANGED
@@ -1,39 +1,17 @@
1
  ---
2
- base_model: []
3
- library_name: transformers
 
 
 
 
 
4
  tags:
5
- - mergekit
6
- - merge
7
-
8
  ---
9
- # Untitled Model (1)
10
-
11
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
12
-
13
- ## Merge Details
14
- ### Merge Method
15
-
16
- This model was merged using the [linear](https://arxiv.org/abs/2203.05482) merge method.
17
-
18
- ### Models Merged
19
-
20
- The following models were included in the merge:
21
- * ./Qwen2.5-72B-Instruct/
22
- * ./Qwen2.5-72B/
23
-
24
- ### Configuration
25
-
26
- The following YAML configuration was used to produce this model:
27
 
28
- ```yaml
29
- models:
30
- - model: ./Qwen2.5-72B/
31
- parameters:
32
- weight: 0.4
33
- - model: ./Qwen2.5-72B-Instruct/
34
- parameters:
35
- weight: 0.6
36
- merge_method: linear
37
- dtype: bfloat16
38
 
39
- ```
 
1
  ---
2
+ license: other
3
+ license_name: qwen
4
+ license_link: https://huggingface.co/Qwen/Qwen2.5-72B-Instruct/blob/main/LICENSE
5
+ language:
6
+ - en
7
+ pipeline_tag: text-generation
8
+ base_model: Qwen/Qwen2.5-72B
9
  tags:
10
+ - chat
11
+ library_name: transformers
 
12
  ---
13
+ # Qwen2.5-72B-0.6x-Instruct
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
14
 
15
+ This is a linear merge of [Qwen/Qwen2.5-72B-Instruct](https://huggingface.co/Qwen/Qwen2.5-72B-Instruct) at weight `0.6` and [Qwen/Qwen2.5-72B](https://huggingface.co/Qwen/Qwen2.5-72B) at weight `0.4`.
 
 
 
 
 
 
 
 
 
16
 
17
+ The resulting model is 60% Instruct and 40% base model, hence the name **`0.6x-Instruct`**.