Upload folder using huggingface_hub

Files changed (7) hide show

README.md CHANGED Viewed

@@ -1,7 +1,10 @@
 ---
 base_model:
 - bunnycore/Qwen2.5-7B-MixStock-V0.1
-- bunnycore/Qwen-2.5-7b-rp-lora
 library_name: transformers
 tags:
 - mergekit
@@ -15,25 +18,35 @@ This is a merge of pre-trained language models created using [mergekit](https://
 ## Merge Details
 ### Merge Method
-This model was merged using the Passthrough merge method using [bunnycore/Qwen2.5-7B-MixStock-V0.1](https://huggingface.co/bunnycore/Qwen2.5-7B-MixStock-V0.1) + [bunnycore/Qwen-2.5-7b-rp-lora](https://huggingface.co/bunnycore/Qwen-2.5-7b-rp-lora) as a base.
 ### Models Merged
 The following models were included in the merge:
 ### Configuration
 The following YAML configuration was used to produce this model:
 ```yaml
-base_model: bunnycore/Qwen2.5-7B-MixStock-V0.1+bunnycore/Qwen-2.5-7b-rp-lora
-dtype: bfloat16
-merge_method: passthrough
 models:
-  - model: bunnycore/Qwen2.5-7B-MixStock-V0.1+bunnycore/Qwen-2.5-7b-rp-lora
-tokenizer_source: bunnycore/Qwen2.5-7B-MixStock-V0.1
 ```

 ---
 base_model:
 - bunnycore/Qwen2.5-7B-MixStock-V0.1
+- bunnycore/Qwen2.5-7B-RRP-1M
+- nvidia/AceInstruct-7B
+- open-r1/OpenR1-Qwen-7B
+- open-thoughts/OpenThinker-7B
 library_name: transformers
 tags:
 - mergekit
 ## Merge Details
 ### Merge Method
+This model was merged using the [SCE](https://arxiv.org/abs/2408.07990) merge method using [bunnycore/Qwen2.5-7B-RRP-1M](https://huggingface.co/bunnycore/Qwen2.5-7B-RRP-1M) as a base.
 ### Models Merged
 The following models were included in the merge:
+* [bunnycore/Qwen2.5-7B-MixStock-V0.1](https://huggingface.co/bunnycore/Qwen2.5-7B-MixStock-V0.1)
+* [nvidia/AceInstruct-7B](https://huggingface.co/nvidia/AceInstruct-7B)
+* [open-r1/OpenR1-Qwen-7B](https://huggingface.co/open-r1/OpenR1-Qwen-7B)
+* [open-thoughts/OpenThinker-7B](https://huggingface.co/open-thoughts/OpenThinker-7B)
 ### Configuration
 The following YAML configuration was used to produce this model:
 ```yaml
 models:
+  # Pivot model
+  - model: bunnycore/Qwen2.5-7B-RRP-1M
+  # Target models
+  - model: open-thoughts/OpenThinker-7B
+  - model: open-r1/OpenR1-Qwen-7B
+  - model: bunnycore/Qwen2.5-7B-MixStock-V0.1
+  - model: nvidia/AceInstruct-7B
+merge_method: sce
+base_model: bunnycore/Qwen2.5-7B-RRP-1M
+parameters:
+  select_topk: 0.65
+  int8_mask: true
+dtype: bfloat16
 ```

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "_name_or_path": "bunnycore/Qwen2.5-7B-MixStock-V0.1",
   "architectures": [
     "Qwen2ForCausalLM"
   ],

 {
+  "_name_or_path": "bunnycore/Qwen2.5-7B-RRP-1M",
   "architectures": [
     "Qwen2ForCausalLM"
   ],

mergekit_config.yml CHANGED Viewed

@@ -1,8 +1,15 @@
-base_model: bunnycore/Qwen2.5-7B-MixStock-V0.1+bunnycore/Qwen-2.5-7b-rp-lora
-dtype: bfloat16
-merge_method: passthrough
 models:
-  - model: bunnycore/Qwen2.5-7B-MixStock-V0.1+bunnycore/Qwen-2.5-7b-rp-lora
-tokenizer_source: bunnycore/Qwen2.5-7B-MixStock-V0.1

 models:
+  # Pivot model
+  - model: bunnycore/Qwen2.5-7B-RRP-1M
+  # Target models
+  - model: open-thoughts/OpenThinker-7B
+  - model: open-r1/OpenR1-Qwen-7B
+  - model: bunnycore/Qwen2.5-7B-MixStock-V0.1
+  - model: nvidia/AceInstruct-7B
+merge_method: sce
+base_model: bunnycore/Qwen2.5-7B-RRP-1M
+parameters:
+  select_topk: 0.65
+  int8_mask: true
+dtype: bfloat16

model-00001-of-00004.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6f34dd956765ecc73ad7a9c6e0e573b0324586e1b65a922ba5cb32efef4b2ba4
 size 4970978712

 version https://git-lfs.github.com/spec/v1
+oid sha256:8c2127c19dc89c055407464e036b94d59fc501f510f54b870c8eeb7af4603a1e
 size 4970978712

model-00002-of-00004.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:14a3be136f3bd474fe77b6b90a825e1910d247ecdd9cdd65b1fa85f983f2cce1
 size 4932751032

 version https://git-lfs.github.com/spec/v1
+oid sha256:299b889e81aaf7adb0cc6a95f35d4b73c8b0a742778124cde8004e2623da3d71
 size 4932751032

model-00003-of-00004.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5f68b371c26931d3ec8f1a46f98032efde0e8ee61ce20eeb3ac55ab1827bbe2c
 size 4991495808

 version https://git-lfs.github.com/spec/v1
+oid sha256:38c87fd618dbc84bdb83152f44d5a758f64669ff2332385ed5a0f58e6c5e49cf
 size 4991495808

model-00004-of-00004.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:cf531c55de122b931de87a8450514e3f80e1b42adf0b86dc341dec2de6102ddc
 size 330326240

 version https://git-lfs.github.com/spec/v1
+oid sha256:0ab21de1801cf249039b4f14764a6b81f8e4a34cb146f14f073bb3802b4d81a0
 size 330326240