Update README.md
Browse files
README.md
CHANGED
@@ -14,10 +14,12 @@ language:
|
|
14 |
# DeepSeek-R1-Distill-Qwen-Coder-32B-Fusion-9010
|
15 |
|
16 |
## Overview
|
17 |
-
`DeepSeek-R1-Distill-Qwen-Coder-32B-Fusion-9010` is a mixed model that combines the strengths of two powerful DeepSeek-R1-Distill-Qwen-based models:
|
18 |
-
|
19 |
-
|
20 |
|
|
|
|
|
21 |
Improve thinking abilities in programming and code. If any of the models meet your expectations, please give a thumbs up. This will help us finalize which model best meets
|
22 |
everyone's expectations.
|
23 |
|
@@ -29,10 +31,6 @@ everyone's expectations.
|
|
29 |
- **Architecture:** Qwen2.5
|
30 |
- **Mixing Ratio:** 9:1 (DeepSeek-R1-Distill-Qwen-32B-abliterated:Qwen2.5-Coder-32B-Instruct-abliterated)
|
31 |
|
32 |
-
## Key Features
|
33 |
-
- **DeepSeek-R1-Distill-Qwen-32B-abliterated(90%):** This is an uncensored version of DeepSeek-R1-Distill-Qwen-32B created with abliteration.
|
34 |
-
- **Qwen2.5-Coder-32B-Instruct-abliterated Contributions (10%):** This is an uncensored version of Qwen2.5-Coder-32B-Instruct created with abliteration.
|
35 |
-
|
36 |
## Usage
|
37 |
You can use this mixed model in your applications by loading it with Hugging Face's `transformers` library:
|
38 |
|
|
|
14 |
# DeepSeek-R1-Distill-Qwen-Coder-32B-Fusion-9010
|
15 |
|
16 |
## Overview
|
17 |
+
`DeepSeek-R1-Distill-Qwen-Coder-32B-Fusion-9010` is a mixed model that combines the strengths of two powerful DeepSeek-R1-Distill-Qwen-based models:
|
18 |
+
[huihui-ai/DeepSeek-R1-Distill-Qwen-32B-abliterated](https://huggingface.co/huihui-ai/DeepSeek-R1-Distill-Qwen-32B-abliterated) and
|
19 |
+
[huihui-ai/Qwen2.5-Coder-32B-Instruct-abliterated](https://huggingface.co/huihui-ai/Qwen2.5-Coder-32B-Instruct-abliterated).
|
20 |
|
21 |
+
**Although it's a simple mix, the model is usable, and no gibberish has appeared**.
|
22 |
+
This is an experiment.
|
23 |
Improve thinking abilities in programming and code. If any of the models meet your expectations, please give a thumbs up. This will help us finalize which model best meets
|
24 |
everyone's expectations.
|
25 |
|
|
|
31 |
- **Architecture:** Qwen2.5
|
32 |
- **Mixing Ratio:** 9:1 (DeepSeek-R1-Distill-Qwen-32B-abliterated:Qwen2.5-Coder-32B-Instruct-abliterated)
|
33 |
|
|
|
|
|
|
|
|
|
34 |
## Usage
|
35 |
You can use this mixed model in your applications by loading it with Hugging Face's `transformers` library:
|
36 |
|