helizhi commited on
Commit
dc17d52
·
verified ·
1 Parent(s): 9534590

Upload README.md

Browse files
Files changed (1) hide show
  1. README.md +33 -0
README.md ADDED
@@ -0,0 +1,33 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision Support
2
+
3
+ <p align="center">
4
+ <a href="https://arxiv.org/abs/2502.18274" target="_blank">📑Paper</a> |<a href="https://jdh-algo.github.io/Citrus/" target="_blank">🤗Github Page</a> |<a href="https://huggingface.co/jdh-algo/Citrus1.0-llama-70B" target="_blank">🤗Model</a> |<a href="https://huggingface.co/datasets/jdh-algo/Citrus_S3" target="_blank">📚Medical Reasoning Data</a> | <a href="https://huggingface.co/datasets/jdh-algo/JMED" target="_blank">📚Evaluation Data</a>
5
+ </p>
6
+
7
+ ## The Introduction to Our Work
8
+ ### 1. Main approaches
9
+ <div align="center">
10
+ <img src="https://raw.githubusercontent.com/jdh-algo/Citrus/main/static/images/figure4-1-2.png" alt="image" width="75%"/>
11
+ </div>
12
+
13
+
14
+ ### 2. Overview of training stages and training data pipeline
15
+
16
+ <div align="center">
17
+ <img src="https://raw.githubusercontent.com/jdh-algo/Citrus/main/static/images/figure4-2-1.png" width="75%">
18
+ </div>
19
+ Citrus is a medical language model that bridges the gap between clinical expertise and AI reasoning by emulating the cognitive processes of medical experts. The model is trained on a large corpus of simulated expert disease reasoning data in sft-stage-3, synthesized using a novel approach that accurately captures the decision-making pathways of clinicians.
20
+
21
+ The contributions of this work are as follows:
22
+
23
+ 1. We propose a training-free reasoning approach that emulates the cognitive processes of medical experts, enabling large language models to enhance their medical capabilities in clinical diagnosis and treatment.
24
+
25
+ 2. In conjunction with the data construction method, we introduce a multi-stage post-training approach to further improve the model’s medical performance.
26
+
27
+ 3. We have made the Citrus model and its training data publicly available as open-source resources to advance research in AI-driven medical decision-making.
28
+
29
+ 4. We have developed and open-sourced a large-scale, updatable clinical practice evaluation dataset based on real-world data, accurately reflecting the distribution of patients in real-world settings.
30
+
31
+ ## Notice
32
+ 1. Our model is built with Qwen2.5-72B, Qwen is licensed under the Qwen LICENSE AGREEMENT, Copyright (c) Alibaba Cloud. All Rights Reserved.
33
+ 2. Our default license is MIT, provided it does not conflict with the Qwen LICENSE.