Spaces:
Sleeping
Sleeping
Commit
·
7302c8f
1
Parent(s):
ceaeef3
update: system prompt in FigureAnnotatorFromPageImage
Browse files
medrag_multi_modal/assistant/figure_annotation.py
CHANGED
@@ -20,22 +20,24 @@ class FigureAnnotatorFromPageImage(weave.Model):
|
|
20 |
annotation = self.llm_client.predict(
|
21 |
system_prompt="""
|
22 |
You are an expert in the domain of scientific textbooks, especially medical texts.
|
23 |
-
You are presented with a page from a scientific textbook.
|
24 |
-
You are to first identify the
|
25 |
-
Then you are to identify the figure IDs associated with each figure in the image.
|
26 |
-
Then, you are to extract the exact figure descriptions from the image.
|
27 |
-
You need to output the figure IDs and descriptions in a structured manner as a JSON object.
|
28 |
|
29 |
Here are some clues you need to follow:
|
30 |
-
1. Figure IDs are unique identifiers for each figure in the image.
|
31 |
2. Sometimes figure IDs can also be found as captions to the immediate left, right, top, or bottom of the figure.
|
32 |
3. Figure IDs are in the form "Fig X.Y" where X and Y are integers. For example, 1.1, 1.2, 1.3, etc.
|
33 |
4. Figure descriptions are contained as captions under the figures in the image, just after the figure ID.
|
34 |
-
5. The text in the image is written in English and is present in a two-column format.
|
35 |
-
6. There is a clear distinction between the figure caption and the regular text in the image in the form of extra white space.
|
36 |
-
|
|
|
|
|
37 |
8. The figures may or may not have a distinct border against a white background.
|
38 |
-
|
39 |
""",
|
40 |
user_prompt=[page_image],
|
41 |
)
|
|
|
20 |
annotation = self.llm_client.predict(
|
21 |
system_prompt="""
|
22 |
You are an expert in the domain of scientific textbooks, especially medical texts.
|
23 |
+
You are presented with a page from a scientific textbook from the domain of biology, specifically anatomy.
|
24 |
+
You are to first identify all the figures in the page image, which could be images or biological diagrams, charts, graphs, etc.
|
25 |
+
Then you are to identify the figure IDs associated with each figure in the page image.
|
26 |
+
Then, you are to extract only the exact figure descriptions from the page image.
|
27 |
+
You need to output the figure IDs and figure descriptions only, in a structured manner as a JSON object.
|
28 |
|
29 |
Here are some clues you need to follow:
|
30 |
+
1. Figure IDs are unique identifiers for each figure in the page image.
|
31 |
2. Sometimes figure IDs can also be found as captions to the immediate left, right, top, or bottom of the figure.
|
32 |
3. Figure IDs are in the form "Fig X.Y" where X and Y are integers. For example, 1.1, 1.2, 1.3, etc.
|
33 |
4. Figure descriptions are contained as captions under the figures in the image, just after the figure ID.
|
34 |
+
5. The text in the page image is written in English and is present in a two-column format.
|
35 |
+
6. There is a clear distinction between the figure caption and the regular text in the page image in the form of extra white space.
|
36 |
+
You are to carefully identify all the figures in the page image.
|
37 |
+
7. There might be multiple figures or even no figures present in the page image. Sometimes the figures can be present side-by-side
|
38 |
+
or one above the other.
|
39 |
8. The figures may or may not have a distinct border against a white background.
|
40 |
+
10. You are not supposed to alter the figure description in any way present in the page image and you are to extract it as is.
|
41 |
""",
|
42 |
user_prompt=[page_image],
|
43 |
)
|