EmbeddedLLM
/

Mistral-7B-Merge-14-v0

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

thesunday commited on Dec 19, 2023

Commit

1309774

·

1 Parent(s): 6fa9985

Update README.md

Files changed (1) hide show

README.md +13 -3

README.md CHANGED Viewed

@@ -1,9 +1,19 @@
 ---
-license: apache-2.0
 language:
 - en
 ---
 # Model Description
 This is an experiment to test merging 14 models using DARE TIES 🦙
@@ -90,9 +100,9 @@ models:
       weight: 0.08
       density: 0.5
 merge_method: dare_ties
-base_model: /media/data5/hf_models/Mistral-7B-v0.1
 parameters:
   int8_mask: true
 dtype: bfloat16
-```

 ---
+license: cc
 language:
 - en
 ---
+# Update 2023-12-19
+In light of [dataset contamination issue among the merged models](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard/discussions/474)
+raised by the community in recent days, in particular
+[berkeley-nest/Starling-LM-7B-alpha](https://huggingface.co/berkeley-nest/Starling-LM-7B-alpha), and
+[Q-bert/MetaMath-Cybertron-Starling](https://huggingface.co/Q-bert/MetaMath-Cybertron-Starling),
+we decided to remake another model without the models mentioned.
+Additionally, their CC-by-NC-4.0 license is restrictive and thus are not suitable for an open model.
 # Model Description
 This is an experiment to test merging 14 models using DARE TIES 🦙
       weight: 0.08
       density: 0.5
 merge_method: dare_ties
+base_model: mistralai/Mistral-7B-v0.1
 parameters:
   int8_mask: true
 dtype: bfloat16
+```