YanBoChen commited on
Commit
d72142e
·
1 Parent(s): 8de0937

feat: add emergency subset analysis plots and statistics report

Browse files
dataset/analysis/stats/analysis_stats_emergency_subset_opt.json ADDED
@@ -0,0 +1,55 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "basic_statistics": {
3
+ "total_records": 11914,
4
+ "avg_length": 23847.07579318449
5
+ },
6
+ "keyword_statistics": {
7
+ "Acute abdomen": 52,
8
+ "Acute bleeding": 31,
9
+ "Acute Coronary Syndrome": 351,
10
+ "Acute Kidney Injury": 202,
11
+ "Acute pancreatitis": 214,
12
+ "Acute respiratory distress syndrome": 231,
13
+ "Acute stroke": 67,
14
+ "Anaphylaxis": 1016,
15
+ "Anaphylactic Shock": 153,
16
+ "Arrhythmia": 1564,
17
+ "Atrial fibrillation": 771,
18
+ "Atrial flutter": 146,
19
+ "Bradycardia": 884,
20
+ "Cardiac arrest": 614,
21
+ "Cardiogenic Shock": 196,
22
+ "Chest pain": 1434,
23
+ "Dyspnea": 1319,
24
+ "Fever": 4279,
25
+ "Gastrointestinal Hemorrhage": 158,
26
+ "GI bleeding": 105,
27
+ "Hemorrhage": 1621,
28
+ "Hemorrhagic stroke": 117,
29
+ "Hyperthermia": 305,
30
+ "Hypovolemic Shock": 63,
31
+ "Hypotension": 1929,
32
+ "Hypothermia": 356,
33
+ "Internal bleeding": 70,
34
+ "Intracranial Hemorrhages": 6,
35
+ "Ischemic stroke": 225,
36
+ "Loss of consciousness": 422,
37
+ "Myocardial Infarction": 1710,
38
+ "MI": 11773,
39
+ "Pulmonary Edema": 487,
40
+ "Pulmonary Embolism": 654,
41
+ "Respiratory distress": 730,
42
+ "Respiratory failure": 579,
43
+ "Sepsis": 1188,
44
+ "Severe Sepsis": 81,
45
+ "Septic Shock": 244,
46
+ "Shock": 1892,
47
+ "Status Epilepticus": 150,
48
+ "Syncope": 834,
49
+ "Tachycardia": 1651,
50
+ "Tachypnea": 268,
51
+ "Traumatic Brain Injury": 171,
52
+ "Ventricular Tachycardia": 492,
53
+ "Ventricular fibrillation": 295
54
+ }
55
+ }
dataset/analysis/subset_comparison_first10_records_20250726_163149.md ADDED
@@ -0,0 +1,198 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Optimized Subsets Comparison Report
2
+
3
+ Generated on: 2025-07-26 16:31:49
4
+
5
+ File format: CSV
6
+
7
+
8
+ ## Basic Statistics
9
+
10
+ - Emergency subset total records: 11914
11
+ - Emergency+Treatment subset total records: 11023
12
+ - Avg Emergency Text Length: 23847.08
13
+ - Avg Treatment Text Length: 25408.64
14
+ - Avg Emergency Keywords: 2.85
15
+ - Avg Treatment Keywords: 2.97
16
+
17
+ ## Emergency Subset (First 10 Records)
18
+
19
+
20
+ ### Record 1
21
+ ```
22
+ Text preview: # Section 1: Recommendations
23
+
24
+ # RECOMMENDATIONS Recommendation 1: General Measures Committee Respons...
25
+ Matched keywords: shock
26
+ Text length: 37792
27
+ ```
28
+
29
+
30
+ ### Record 2
31
+ ```
32
+ Text preview: Evidence-based Series 4-9 Version 2 A Quality Initiative of the Program in Evidence-based Care (PEBC...
33
+ Matched keywords: hemorrhage
34
+ Text length: 7559
35
+ ```
36
+
37
+
38
+ ### Record 3
39
+ ```
40
+ Text preview: Neuroendocrine tumours (NETs) constitute a heterogeneous group of neoplasms: they include epithelial...
41
+ Matched keywords: ards|pulmonary embolism
42
+ Text length: 11731
43
+ ```
44
+
45
+
46
+ ### Record 4
47
+ ```
48
+ Text preview: Given the potential toxicities associated with alemtuzumab, and given the limited nature of the clin...
49
+ Matched keywords: fever|dyspnea|hypotension|sepsis
50
+ Text length: 46087
51
+ ```
52
+
53
+
54
+ ### Record 5
55
+ ```
56
+ Text preview: Although the incidence and mortality of gastric cancer has been steadily decreasing in Canadian men ...
57
+ Matched keywords: hyperthermia
58
+ Text length: 35302
59
+ ```
60
+
61
+
62
+ ### Record 6
63
+ ```
64
+ Text preview: There are various definitions for palliative care, but most people would agree that "it focuses on c...
65
+ Matched keywords: hemorrhage|dyspnea
66
+ Text length: 16186
67
+ ```
68
+
69
+
70
+ ### Record 7
71
+ ```
72
+ Text preview: # GUIDELINE OBJECTIVES
73
+ The objective of this guideline is to update a previous guideline on chemothe...
74
+ Matched keywords: hemorrhage
75
+ Text length: 7551
76
+ ```
77
+
78
+
79
+ ### Record 8
80
+ ```
81
+ Text preview: Anthracyclines have been established to be superior to some non-anthracycline chemotherapy regimens ...
82
+ Matched keywords: mi
83
+ Text length: 50729
84
+ ```
85
+
86
+
87
+ ### Record 9
88
+ ```
89
+ Text preview: # GUIDELINE OBJECTIVE
90
+ This guideline was written to provide guidance on the most appropriate follow-...
91
+ Matched keywords: hemorrhage
92
+ Text length: 4299
93
+ ```
94
+
95
+
96
+ ### Record 10
97
+ ```
98
+ Text preview: PDT is a local treatment. It utilizes the local, selective, cytotoxic reaction produced by photosens...
99
+ Matched keywords: dyspnea|mi|hemorrhage|respiratory_failure|cva|hypotension|sepsis|ards
100
+ Text length: 54427
101
+ ```
102
+
103
+
104
+ ## Emergency+Treatment Subset (First 10 Records)
105
+
106
+
107
+ ### Record 1
108
+ ```
109
+ Text preview: # Section 1: Recommendations
110
+
111
+ # RECOMMENDATIONS Recommendation 1: General Measures Committee Respons...
112
+ Emergency keywords: shock
113
+ Treatment keywords: management|medication|procedure|fluid|monitoring|iv|administer|dose
114
+ Text length: 37792
115
+ ```
116
+
117
+
118
+ ### Record 2
119
+ ```
120
+ Text preview: Evidence-based Series 4-9 Version 2 A Quality Initiative of the Program in Evidence-based Care (PEBC...
121
+ Emergency keywords: hemorrhage
122
+ Treatment keywords: Therapy|treatment|x-ray|us|ct
123
+ Text length: 7559
124
+ ```
125
+
126
+
127
+ ### Record 3
128
+ ```
129
+ Text preview: Neuroendocrine tumours (NETs) constitute a heterogeneous group of neoplasms: they include epithelial...
130
+ Emergency keywords: ards|pulmonary embolism
131
+ Treatment keywords: dopamine|therapy|treatment|surgery|iv|intervention|dose
132
+ Text length: 11731
133
+ ```
134
+
135
+
136
+ ### Record 4
137
+ ```
138
+ Text preview: Given the potential toxicities associated with alemtuzumab, and given the limited nature of the clin...
139
+ Emergency keywords: fever|dyspnea|hypotension|sepsis
140
+ Treatment keywords: treatment|iv|therapy|treat|management|intervention|supportive care|dose
141
+ Text length: 46087
142
+ ```
143
+
144
+
145
+ ### Record 5
146
+ ```
147
+ Text preview: Although the incidence and mortality of gastric cancer has been steadily decreasing in Canadian men ...
148
+ Emergency keywords: hyperthermia
149
+ Treatment keywords: surgery|treatment|therapy|treat|dose|ct
150
+ Text length: 35302
151
+ ```
152
+
153
+
154
+ ### Record 6
155
+ ```
156
+ Text preview: There are various definitions for palliative care, but most people would agree that "it focuses on c...
157
+ Emergency keywords: hemorrhage|dyspnea
158
+ Treatment keywords: therapy|management|treatment|morphine|dose
159
+ Text length: 16186
160
+ ```
161
+
162
+
163
+ ### Record 7
164
+ ```
165
+ Text preview: # GUIDELINE OBJECTIVES
166
+ The objective of this guideline is to update a previous guideline on chemothe...
167
+ Emergency keywords: hemorrhage
168
+ Treatment keywords: therapy|treatment|surgery
169
+ Text length: 7551
170
+ ```
171
+
172
+
173
+ ### Record 8
174
+ ```
175
+ Text preview: Anthracyclines have been established to be superior to some non-anthracycline chemotherapy regimens ...
176
+ Emergency keywords: mi
177
+ Treatment keywords: iv|Dose|therapy|administer|surgery|treatment|treat|medication|ecg
178
+ Text length: 50729
179
+ ```
180
+
181
+
182
+ ### Record 9
183
+ ```
184
+ Text preview: # GUIDELINE OBJECTIVE
185
+ This guideline was written to provide guidance on the most appropriate follow-...
186
+ Emergency keywords: hemorrhage
187
+ Treatment keywords: treatment|ct
188
+ Text length: 4299
189
+ ```
190
+
191
+
192
+ ### Record 10
193
+ ```
194
+ Text preview: PDT is a local treatment. It utilizes the local, selective, cytotoxic reaction produced by photosens...
195
+ Emergency keywords: dyspnea|mi|hemorrhage|respiratory_failure|cva|hypotension|sepsis|ards
196
+ Treatment keywords: treatment|oxygen|iv|dose|therapy|surgery|x-ray|administer|procedure|management
197
+ Text length: 54427
198
+ ```
dataset/analysis/subset_comparison_first10_records_20250726_163158.md ADDED
@@ -0,0 +1,198 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Optimized Subsets Comparison Report
2
+
3
+ Generated on: 2025-07-26 16:31:58
4
+
5
+ File format: JSONL
6
+
7
+
8
+ ## Basic Statistics
9
+
10
+ - Emergency subset total records: 11914
11
+ - Emergency+Treatment subset total records: 11023
12
+ - Avg Emergency Text Length: 23847.08
13
+ - Avg Treatment Text Length: 25408.64
14
+ - Avg Emergency Keywords: 2.85
15
+ - Avg Treatment Keywords: 2.97
16
+
17
+ ## Emergency Subset (First 10 Records)
18
+
19
+
20
+ ### Record 1
21
+ ```
22
+ Text preview: # Section 1: Recommendations
23
+
24
+ # RECOMMENDATIONS Recommendation 1: General Measures Committee Respons...
25
+ Matched keywords: shock
26
+ Text length: 37792
27
+ ```
28
+
29
+
30
+ ### Record 2
31
+ ```
32
+ Text preview: Evidence-based Series 4-9 Version 2 A Quality Initiative of the Program in Evidence-based Care (PEBC...
33
+ Matched keywords: hemorrhage
34
+ Text length: 7559
35
+ ```
36
+
37
+
38
+ ### Record 3
39
+ ```
40
+ Text preview: Neuroendocrine tumours (NETs) constitute a heterogeneous group of neoplasms: they include epithelial...
41
+ Matched keywords: ards|pulmonary embolism
42
+ Text length: 11731
43
+ ```
44
+
45
+
46
+ ### Record 4
47
+ ```
48
+ Text preview: Given the potential toxicities associated with alemtuzumab, and given the limited nature of the clin...
49
+ Matched keywords: fever|dyspnea|hypotension|sepsis
50
+ Text length: 46087
51
+ ```
52
+
53
+
54
+ ### Record 5
55
+ ```
56
+ Text preview: Although the incidence and mortality of gastric cancer has been steadily decreasing in Canadian men ...
57
+ Matched keywords: hyperthermia
58
+ Text length: 35302
59
+ ```
60
+
61
+
62
+ ### Record 6
63
+ ```
64
+ Text preview: There are various definitions for palliative care, but most people would agree that "it focuses on c...
65
+ Matched keywords: hemorrhage|dyspnea
66
+ Text length: 16186
67
+ ```
68
+
69
+
70
+ ### Record 7
71
+ ```
72
+ Text preview: # GUIDELINE OBJECTIVES
73
+ The objective of this guideline is to update a previous guideline on chemothe...
74
+ Matched keywords: hemorrhage
75
+ Text length: 7551
76
+ ```
77
+
78
+
79
+ ### Record 8
80
+ ```
81
+ Text preview: Anthracyclines have been established to be superior to some non-anthracycline chemotherapy regimens ...
82
+ Matched keywords: mi
83
+ Text length: 50729
84
+ ```
85
+
86
+
87
+ ### Record 9
88
+ ```
89
+ Text preview: # GUIDELINE OBJECTIVE
90
+ This guideline was written to provide guidance on the most appropriate follow-...
91
+ Matched keywords: hemorrhage
92
+ Text length: 4299
93
+ ```
94
+
95
+
96
+ ### Record 10
97
+ ```
98
+ Text preview: PDT is a local treatment. It utilizes the local, selective, cytotoxic reaction produced by photosens...
99
+ Matched keywords: dyspnea|mi|hemorrhage|respiratory_failure|cva|hypotension|sepsis|ards
100
+ Text length: 54427
101
+ ```
102
+
103
+
104
+ ## Emergency+Treatment Subset (First 10 Records)
105
+
106
+
107
+ ### Record 1
108
+ ```
109
+ Text preview: # Section 1: Recommendations
110
+
111
+ # RECOMMENDATIONS Recommendation 1: General Measures Committee Respons...
112
+ Emergency keywords: shock
113
+ Treatment keywords: management|medication|procedure|fluid|monitoring|iv|administer|dose
114
+ Text length: 37792
115
+ ```
116
+
117
+
118
+ ### Record 2
119
+ ```
120
+ Text preview: Evidence-based Series 4-9 Version 2 A Quality Initiative of the Program in Evidence-based Care (PEBC...
121
+ Emergency keywords: hemorrhage
122
+ Treatment keywords: Therapy|treatment|x-ray|us|ct
123
+ Text length: 7559
124
+ ```
125
+
126
+
127
+ ### Record 3
128
+ ```
129
+ Text preview: Neuroendocrine tumours (NETs) constitute a heterogeneous group of neoplasms: they include epithelial...
130
+ Emergency keywords: ards|pulmonary embolism
131
+ Treatment keywords: dopamine|therapy|treatment|surgery|iv|intervention|dose
132
+ Text length: 11731
133
+ ```
134
+
135
+
136
+ ### Record 4
137
+ ```
138
+ Text preview: Given the potential toxicities associated with alemtuzumab, and given the limited nature of the clin...
139
+ Emergency keywords: fever|dyspnea|hypotension|sepsis
140
+ Treatment keywords: treatment|iv|therapy|treat|management|intervention|supportive care|dose
141
+ Text length: 46087
142
+ ```
143
+
144
+
145
+ ### Record 5
146
+ ```
147
+ Text preview: Although the incidence and mortality of gastric cancer has been steadily decreasing in Canadian men ...
148
+ Emergency keywords: hyperthermia
149
+ Treatment keywords: surgery|treatment|therapy|treat|dose|ct
150
+ Text length: 35302
151
+ ```
152
+
153
+
154
+ ### Record 6
155
+ ```
156
+ Text preview: There are various definitions for palliative care, but most people would agree that "it focuses on c...
157
+ Emergency keywords: hemorrhage|dyspnea
158
+ Treatment keywords: therapy|management|treatment|morphine|dose
159
+ Text length: 16186
160
+ ```
161
+
162
+
163
+ ### Record 7
164
+ ```
165
+ Text preview: # GUIDELINE OBJECTIVES
166
+ The objective of this guideline is to update a previous guideline on chemothe...
167
+ Emergency keywords: hemorrhage
168
+ Treatment keywords: therapy|treatment|surgery
169
+ Text length: 7551
170
+ ```
171
+
172
+
173
+ ### Record 8
174
+ ```
175
+ Text preview: Anthracyclines have been established to be superior to some non-anthracycline chemotherapy regimens ...
176
+ Emergency keywords: mi
177
+ Treatment keywords: iv|Dose|therapy|administer|surgery|treatment|treat|medication|ecg
178
+ Text length: 50729
179
+ ```
180
+
181
+
182
+ ### Record 9
183
+ ```
184
+ Text preview: # GUIDELINE OBJECTIVE
185
+ This guideline was written to provide guidance on the most appropriate follow-...
186
+ Emergency keywords: hemorrhage
187
+ Treatment keywords: treatment|ct
188
+ Text length: 4299
189
+ ```
190
+
191
+
192
+ ### Record 10
193
+ ```
194
+ Text preview: PDT is a local treatment. It utilizes the local, selective, cytotoxic reaction produced by photosens...
195
+ Emergency keywords: dyspnea|mi|hemorrhage|respiratory_failure|cva|hypotension|sepsis|ards
196
+ Treatment keywords: treatment|oxygen|iv|dose|therapy|surgery|x-ray|administer|procedure|management
197
+ Text length: 54427
198
+ ```