ddh0
/

Qwen2.5-72B-0.6x-Instruct

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

ddh0 commited on Nov 16, 2024

Commit

30ab6db

·

verified ·

1 Parent(s): c64d81b

Update README.md

Files changed (1) hide show

README.md +12 -34

README.md CHANGED Viewed

@@ -1,39 +1,17 @@
 ---
-base_model: []
-library_name: transformers
 tags:
-- mergekit
-- merge
 ---
-# Untitled Model (1)
-This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
-## Merge Details
-### Merge Method
-This model was merged using the [linear](https://arxiv.org/abs/2203.05482) merge method.
-### Models Merged
-The following models were included in the merge:
-* ./Qwen2.5-72B-Instruct/
-* ./Qwen2.5-72B/
-### Configuration
-The following YAML configuration was used to produce this model:
-```yaml
-models:
-  - model: ./Qwen2.5-72B/
-    parameters:
-      weight: 0.4
-  - model: ./Qwen2.5-72B-Instruct/
-    parameters:
-      weight: 0.6
-merge_method: linear
-dtype: bfloat16
-```

 ---
+license: other
+license_name: qwen
+license_link: https://huggingface.co/Qwen/Qwen2.5-72B-Instruct/blob/main/LICENSE
+language:
+- en
+pipeline_tag: text-generation
+base_model: Qwen/Qwen2.5-72B
 tags:
+- chat
+library_name: transformers
 ---
+# Qwen2.5-72B-0.6x-Instruct
+This is a linear merge of [Qwen/Qwen2.5-72B-Instruct](https://huggingface.co/Qwen/Qwen2.5-72B-Instruct) at weight `0.6` and [Qwen/Qwen2.5-72B](https://huggingface.co/Qwen/Qwen2.5-72B) at weight `0.4`.
+The resulting model is 60% Instruct and 40% base model, hence the name **`0.6x-Instruct`**.