Text Classification
English
medical
yishuwei2019 commited on
Commit
908d198
·
verified ·
1 Parent(s): 552eea6

Upload README.md

Browse files
Files changed (1) hide show
  1. README.md +23 -2
README.md CHANGED
@@ -10,5 +10,26 @@ tags:
10
  ---
11
 
12
 
13
- ## RadReportX
14
- ### Model description
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
10
  ---
11
 
12
 
13
+ # RadReportX
14
+ ### Model description
15
+ Llama3.1-8B-instruct model fine tuned on synthetic data. There are two tasks of that this model can achieve. The first task is an open-ended question, which is to detect phrases in a radiology report that represents an ICD-10 code. There is no restriction about the underlying disease. The second task is to detect disease out of 14 candidates from a radiology report. The candidate diseases are [Atelectasis, Cardiomegaly, Consolidation, Edema, Enlarged Cardiomediastinum, Fracture, Lung Lesion, Lung Opacity, Pleural Effusion, Pleural Other, Pneumonia, Pneumothorax, Support Devices]. When there are no diseases out of the candidates, the model will output 'Normal'.
16
+
17
+ ### Training set and training process
18
+ There are two sources of training data. The first set is generated by GPT4o. The second source comes from MIMIC-CXR dataset (https://arxiv.org/pdf/1901.07042), with labels being extracted by Negbio algorithm. The training is conducted using torchtune framework (https://github.com/pytorch/torchtune). For details, please refer to our paper listed below.
19
+
20
+ ### How to use
21
+ Please refer to https://github.com/bionlplab/RadReportX
22
+
23
+ ### Paper
24
+ https://arxiv.org/pdf/2409.16563
25
+
26
+ ### Citation
27
+ @article{wei2024enhancing,
28
+ title={Enhancing disease detection in radiology reports through fine-tuning lightweight LLM on weak labels},
29
+ author={Wei, Yishu and Wang, Xindi and Ong, Hanley and Zhou, Yiliang and Flanders, Adam and Shih, George and Peng, Yifan},
30
+ journal={arXiv preprint arXiv:2409.16563},
31
+ year={2024}
32
+ }
33
+
34
+ ### Disclaimer
35
+ This tool shows the results of research conducted in the Computational Biology Branch, NCBI. The information produced on this website is not intended for direct diagnostic use or medical decision-making without review and oversight by a clinical professional. Individuals should not change their health behavior solely on the basis of information produced on this website. NIH does not independently verify the validity or utility of the information produced by this tool. If you have questions about the information produced on this website, please see a health care professional. More information about NCBI's disclaimer policy is available.