Minh141120 commited on
Commit
15c639a
·
verified ·
1 Parent(s): cd0e3d5

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -12
README.md CHANGED
@@ -1,36 +1,36 @@
1
-
2
  ---
3
  license: mit
4
  ---
5
-
6
  ## Overview
7
 
8
- The 'test-deepseek-r1-distill-qwen-1.5b' model is a distilled variant of the DeepSeek architecture, designed to provide efficient natural language understanding and generation capabilities. It leverages the advancements from 'deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B', optimizing performance while reducing computational requirements. This model targets a range of applications, including conversational AI, content generation, and text summarization, making it ideal for integrating into chatbots, virtual assistants, and automated writing tools. Benchmarked for performance, it exhibits strong capabilities in maintaining context, generating coherent responses, and understanding nuanced queries. Overall, it serves as a lightweight yet powerful solution for developers seeking an effective language model for diverse tasks.
 
 
9
 
10
  ## Variants
11
 
12
  | No | Variant | Cortex CLI command |
13
  | --- | --- | --- |
14
- | 1 | [gguf](https://huggingface.co/cortexso/deepseek-r1-distill-qwen-1.5b/tree/main) | cortex run deepseek-r1-distill-qwen-1.5b |
 
15
 
16
  ## Use it with Jan (UI)
17
 
18
  1. Install **Jan** using [Quickstart](https://jan.ai/docs/quickstart)
19
  2. Use in Jan model Hub:
20
-
21
  cortexso/deepseek-r1-distill-qwen-1.5b
22
-
23
-
24
  ## Use it with Cortex (CLI)
25
 
26
  1. Install **Cortex** using [Quickstart](https://cortex.jan.ai/docs/quickstart)
27
  2. Run the model with command:
28
-
29
  cortex run deepseek-r1-distill-qwen-1.5b
30
-
31
-
32
  ## Credits
33
 
34
- - **Author:** deepseek-ai
35
  - **Converter:** [Homebrew](https://www.homebrew.ltd/)
36
- - **Original License:** [License](https://huggingface.co/cortexso/deepseek-r1-distill-qwen-1.5b#license)
 
 
 
1
  ---
2
  license: mit
3
  ---
 
4
  ## Overview
5
 
6
+ **DeepSeek** developed and released the [DeepSeek R1 Distill Qwen 1.5B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B) model, a distilled version of the Qwen 1.5B language model. It is fine-tuned for high-performance text generation and optimized for dialogue and information-seeking tasks. This model achieves a balance of efficiency and accuracy while maintaining a smaller footprint compared to the original Qwen 1.5B.
7
+
8
+ The model is designed for applications in customer support, conversational AI, and research, prioritizing both helpfulness and safety.
9
 
10
  ## Variants
11
 
12
  | No | Variant | Cortex CLI command |
13
  | --- | --- | --- |
14
+ | 1 | [gguf](https://huggingface.co/cortexso/deepseek-r1-distill-qwen-1.5b/tree/main) | `cortex run deepseek-r1-distill-qwen-1.5b` |
15
+
16
 
17
  ## Use it with Jan (UI)
18
 
19
  1. Install **Jan** using [Quickstart](https://jan.ai/docs/quickstart)
20
  2. Use in Jan model Hub:
21
+ ```text
22
  cortexso/deepseek-r1-distill-qwen-1.5b
23
+ ```
 
24
  ## Use it with Cortex (CLI)
25
 
26
  1. Install **Cortex** using [Quickstart](https://cortex.jan.ai/docs/quickstart)
27
  2. Run the model with command:
28
+ ```bash
29
  cortex run deepseek-r1-distill-qwen-1.5b
30
+ ```
 
31
  ## Credits
32
 
33
+ - **Author:** DeepSeek
34
  - **Converter:** [Homebrew](https://www.homebrew.ltd/)
35
+ - **Original License:** [License](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B#7-license)
36
+ - **Papers:** [DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning](https://arxiv.org/html/2501.12948v1)