Update README.md
Browse files
README.md
CHANGED
@@ -6,9 +6,21 @@ datasets:
|
|
6 |
- StanfordAIMI/interpret-cxr-test-hidden
|
7 |
---
|
8 |
|
9 |
-
#
|
10 |
-
|
11 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
12 |
|
13 |
|
14 |
|
|
|
6 |
- StanfordAIMI/interpret-cxr-test-hidden
|
7 |
---
|
8 |
|
9 |
+
# CXRMate-RRG4
|
10 |
+
|
11 |
+
This is an evolution of https://huggingface.co/aehrc/cxrmate developed for the Radiology Report Generation task of BioNLP @ ACL 2024.
|
12 |
+
|
13 |
+
For this, proposed EAST: Entropy-Augmented Self-critical sequence Training (EAST).
|
14 |
+
EAST modifies Self-Critical Sequence Training (SCST) by adding entropy regularisation.
|
15 |
+
This helps maintain a higher entropy in the token distribution,
|
16 |
+
preventing overfitting to common phrases and ensuring a broader exploration of the vocabulary during training,
|
17 |
+
which is essential for handling the diversity of the radiology reports in the RRG24 datasets.
|
18 |
+
We apply this to a multimodal language model with RadGraph as the reward.
|
19 |
+
|
20 |
+
Additionally, our model incorporates several other aspects.
|
21 |
+
We use token type embeddings to differentiate between findings and impression section tokens, as well as image embeddings.
|
22 |
+
To handle missing sections, we employ special tokens.
|
23 |
+
We also utilise an attention mask with non-causal masking for the image embeddings and a causal mask for the report token embeddings.
|
24 |
|
25 |
|
26 |
|