FangDai commited on
Commit
ee8226c
·
verified ·
1 Parent(s): ed70297

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +37 -2
README.md CHANGED
@@ -40,7 +40,42 @@ This project uses requirements.txt.
40
  $ pip install -r requirements.txt
41
  ```
42
  ## Datasets
43
- 1. Thyroid dataset for Tiger Model (The other external validation datasets(BrEaST, BUSI, VinDr-PCXR) are also deployed in folders in the same manner.)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
44
  ```sh
45
  ├─dataset
46
  └─training data
@@ -55,7 +90,7 @@ $ pip install -r requirements.txt
55
  20191101_094744_1.png
56
  ... ...
57
  ```
58
- 2. Thyroid dataset for Resnet Model (The other external validation datasets(BrEaST, BUSI, VinDr-PCXR) are also deployed in folders in the same manner.)
59
  ```sh
60
  ├─dataset
61
  │ ├─Renset training data
 
40
  $ pip install -r requirements.txt
41
  ```
42
  ## Datasets
43
+ ### 1. Thyroid dataset for Tiger Model (The other external validation datasets(BrEaST, BUSI, VinDr-PCXR) are also deployed in folders in the same manner.)
44
+
45
+ We have shared part of the thyroid ultrasound dataset for verification. Please refer to this article for other studies using this dataset.
46
+ If you use this dataset in your research, please cite the following references:
47
+ A portion of the data from this article is publicly available on Huggingface ([https://huggingface.co/datasets/FangDai/Thyroid_Ultrasound_Images](https://huggingface.co/datasets/FangDai/Thyroid_Ultrasound_Images). To download this dataset, you must register on Hugging Face and sign our data usage application before gaining access.
48
+ ![image4](https://github.com/fangdai-dear/QuasiParetoImprovement/blob/master/scripts/Figure/image4.png)
49
+
50
+ Please read the following information for data usage permissions and the conditions for accessing the full dataset.
51
+ ```
52
+ All data that fueled the findings can be found within the article and the Supplementary Information. The Thyroid datasets trained and analyzed during this study are available in a deidentified form to protect patient privacy. The minimum Thyroid dataset required to interpret, verify, and extend the findings of this study has been deposited in Huggingface under accession code https://huggingface.co/datasets/FangDai/Thyroid_Ultrasound_Images. This includes:
53
+ - Pre-processed imaging data (ultrasound images with anonymized metadata).
54
+ - Clinical feature tables (age, gender, tumor size) with all direct identifiers removed.
55
+ Due to ethical restrictions and patient confidentiality agreements, the full dataset (e.g., raw imaging data, detailed clinical records) cannot be made publicly available. This pertains to detailed clinical records and high-resolution imaging data that, even after de-identification, may pose a risk of re-identification given the unique characteristics of thyroid cancer cases. Researchers who wish to access additional data for non-commercial academic purposes may submit a formal request to the corresponding author. Requests will be reviewed by the institutional ethics committee and data custodians. The following conditions apply:
56
+ - Purpose: Data will only be shared for research purposes that align with the original study objectives.
57
+ - Access Restrictions: Requesters must sign a data use agreement prohibiting re-identification or redistribution.
58
+ - Data Retention: Approved data will be available for 2 years from the date of publication.
59
+ ```
60
+ This dataset contains 900 thyroid ultrasound images, categorized into three subtypes of thyroid carcinoma:
61
+ - PTC (Papillary Thyroid Carcinoma)
62
+ - FTC (Follicular Thyroid Carcinoma)
63
+ - MTC (Medullary Thyroid Carcinoma)
64
+ ##### The dataset is curated to support medical image classification and segmentation tasks, particularly for deep learning applications in thyroid cancer diagnosis.
65
+ It is curated to support medical image classification, particularly for AI applications in thyroid cancer diagnosis.
66
+ #### Citation
67
+ ```bibtex
68
+ @article{yao2024enhancing,
69
+ title={Enhancing the fairness of AI prediction models by Quasi-Pareto improvement among heterogeneous thyroid nodule population},
70
+ author={Yao, Siqiong and Dai, Fang and Sun, Peng and Zhang, Weituo and Qian, Biyun and Lu, Hui},
71
+ journal={Nature Communications},
72
+ volume={15},
73
+ number={1},
74
+ pages={1958},
75
+ year={2024},
76
+ publisher={Nature Publishing Group UK London}
77
+ }
78
+ ```
79
  ```sh
80
  ├─dataset
81
  └─training data
 
90
  20191101_094744_1.png
91
  ... ...
92
  ```
93
+ ### 2. Thyroid dataset for Resnet Model (The other external validation datasets(BrEaST, BUSI, VinDr-PCXR) are also deployed in folders in the same manner.)
94
  ```sh
95
  ├─dataset
96
  │ ├─Renset training data