Upload README.md
Browse files
README.md
ADDED
@@ -0,0 +1,33 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
# Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision Support
|
2 |
+
|
3 |
+
<p align="center">
|
4 |
+
<a href="https://arxiv.org/abs/2502.18274" target="_blank">📑Paper</a> |<a href="https://jdh-algo.github.io/Citrus/" target="_blank">🤗Github Page</a> |<a href="https://huggingface.co/jdh-algo/Citrus1.0-llama-70B" target="_blank">🤗Model</a> |<a href="https://huggingface.co/datasets/jdh-algo/Citrus_S3" target="_blank">📚Medical Reasoning Data</a> | <a href="https://huggingface.co/datasets/jdh-algo/JMED" target="_blank">📚Evaluation Data</a>
|
5 |
+
</p>
|
6 |
+
|
7 |
+
## The Introduction to Our Work
|
8 |
+
### 1. Main approaches
|
9 |
+
<div align="center">
|
10 |
+
<img src="https://raw.githubusercontent.com/jdh-algo/Citrus/main/static/images/figure4-1-2.png" alt="image" width="75%"/>
|
11 |
+
</div>
|
12 |
+
|
13 |
+
|
14 |
+
### 2. Overview of training stages and training data pipeline
|
15 |
+
|
16 |
+
<div align="center">
|
17 |
+
<img src="https://raw.githubusercontent.com/jdh-algo/Citrus/main/static/images/figure4-2-1.png" width="75%">
|
18 |
+
</div>
|
19 |
+
Citrus is a medical language model that bridges the gap between clinical expertise and AI reasoning by emulating the cognitive processes of medical experts. The model is trained on a large corpus of simulated expert disease reasoning data in sft-stage-3, synthesized using a novel approach that accurately captures the decision-making pathways of clinicians.
|
20 |
+
|
21 |
+
The contributions of this work are as follows:
|
22 |
+
|
23 |
+
1. We propose a training-free reasoning approach that emulates the cognitive processes of medical experts, enabling large language models to enhance their medical capabilities in clinical diagnosis and treatment.
|
24 |
+
|
25 |
+
2. In conjunction with the data construction method, we introduce a multi-stage post-training approach to further improve the model’s medical performance.
|
26 |
+
|
27 |
+
3. We have made the Citrus model and its training data publicly available as open-source resources to advance research in AI-driven medical decision-making.
|
28 |
+
|
29 |
+
4. We have developed and open-sourced a large-scale, updatable clinical practice evaluation dataset based on real-world data, accurately reflecting the distribution of patients in real-world settings.
|
30 |
+
|
31 |
+
## Notice
|
32 |
+
1. Our model is built with Qwen2.5-72B, Qwen is licensed under the Qwen LICENSE AGREEMENT, Copyright (c) Alibaba Cloud. All Rights Reserved.
|
33 |
+
2. Our default license is MIT, provided it does not conflict with the Qwen LICENSE.
|