huihui-ai commited on
Commit
4c6652c
·
verified ·
1 Parent(s): 8f44bf1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -7
README.md CHANGED
@@ -14,10 +14,12 @@ language:
14
  # DeepSeek-R1-Distill-Qwen-Coder-32B-Fusion-9010
15
 
16
  ## Overview
17
- `DeepSeek-R1-Distill-Qwen-Coder-32B-Fusion-9010` is a mixed model that combines the strengths of two powerful DeepSeek-R1-Distill-Qwen-based models: [huihui-ai/DeepSeek-R1-Distill-Qwen-32B-abliterated](https://huggingface.co/huihui-ai/DeepSeek-R1-Distill-Qwen-32B-abliterated) and [huihui-ai/Qwen2.5-Coder-32B-Instruct-abliterated](https://huggingface.co/huihui-ai/Qwen2.5-Coder-32B-Instruct-abliterated). The weights are blended in a 9:1 ratio, with 90% of the weights from DeepSeek-R1-Distill-Qwen-32B-abliterated and 10% from the Qwen2.5-Coder-32B-Instruct-abliterated model.
18
- **Although it's a simple mix, the model is usable, and no gibberish has appeared**.
19
- This is an experiment. I test the [9:1](https://huggingface.co/huihui-ai/eepSeek-R1-Distill-Qwen-Coder-32B-Fusion-9010), [8:2](https://huggingface.co/huihui-ai/DeepSeek-R1-Distill-Qwen-Coder-32B-Fusion-8020), [7:3](https://huggingface.co/huihui-ai/DeepSeek-R1-Distill-Qwen-Coder-32B-Fusion-7030), [6:4](https://huggingface.co/huihui-ai/DeepSeek-R1-Distill-Qwen-Coder-32B-Fusion-6040) and [5:5](https://huggingface.co/huihui-ai/DeepSeek-R1-Distill-Qwen-Coder-32B-Fusion-5050) ratios separately to see how much impact they have on the model.
20
 
 
 
21
  Improve thinking abilities in programming and code. If any of the models meet your expectations, please give a thumbs up. This will help us finalize which model best meets
22
  everyone's expectations.
23
 
@@ -29,10 +31,6 @@ everyone's expectations.
29
  - **Architecture:** Qwen2.5
30
  - **Mixing Ratio:** 9:1 (DeepSeek-R1-Distill-Qwen-32B-abliterated:Qwen2.5-Coder-32B-Instruct-abliterated)
31
 
32
- ## Key Features
33
- - **DeepSeek-R1-Distill-Qwen-32B-abliterated(90%):** This is an uncensored version of DeepSeek-R1-Distill-Qwen-32B created with abliteration.
34
- - **Qwen2.5-Coder-32B-Instruct-abliterated Contributions (10%):** This is an uncensored version of Qwen2.5-Coder-32B-Instruct created with abliteration.
35
-
36
  ## Usage
37
  You can use this mixed model in your applications by loading it with Hugging Face's `transformers` library:
38
 
 
14
  # DeepSeek-R1-Distill-Qwen-Coder-32B-Fusion-9010
15
 
16
  ## Overview
17
+ `DeepSeek-R1-Distill-Qwen-Coder-32B-Fusion-9010` is a mixed model that combines the strengths of two powerful DeepSeek-R1-Distill-Qwen-based models:
18
+ [huihui-ai/DeepSeek-R1-Distill-Qwen-32B-abliterated](https://huggingface.co/huihui-ai/DeepSeek-R1-Distill-Qwen-32B-abliterated) and
19
+ [huihui-ai/Qwen2.5-Coder-32B-Instruct-abliterated](https://huggingface.co/huihui-ai/Qwen2.5-Coder-32B-Instruct-abliterated).
20
 
21
+ **Although it's a simple mix, the model is usable, and no gibberish has appeared**.
22
+ This is an experiment.
23
  Improve thinking abilities in programming and code. If any of the models meet your expectations, please give a thumbs up. This will help us finalize which model best meets
24
  everyone's expectations.
25
 
 
31
  - **Architecture:** Qwen2.5
32
  - **Mixing Ratio:** 9:1 (DeepSeek-R1-Distill-Qwen-32B-abliterated:Qwen2.5-Coder-32B-Instruct-abliterated)
33
 
 
 
 
 
34
  ## Usage
35
  You can use this mixed model in your applications by loading it with Hugging Face's `transformers` library:
36