MinerU

Paused

Arsenii11 commited on Feb 21

Commit

5c0022c

1 Parent(s): 8d284aa

Changed prompt for Gemini

Files changed (1) hide show

mineru_single.py CHANGED Viewed

@@ -195,10 +195,18 @@ def call_gemini_for_image_description(image_data: bytes) -> str:
             contents=[
                 {
                     "parts": [
-                        {"text": """The provided image is a part of a question paper or markscheme. Extract all the necessary information from the image to be able to identify the question.
-To identify the question, we only need the following: question number and question part. Don't include redundant information.
-For example, if image contains text like: "Q1 Part A Answer: Life on earth was created by diety..." you should return just "Q1 Part A Mark Scheme"
-If there is no text on this image, return the description of the image. 20 words max."""},
                         {
                             "inline_data": {
                                 "mime_type": "image/jpeg",

             contents=[
                 {
                     "parts": [
+                        {"text": """The provided image is a part of a question paper or markscheme.
+                                    Extract all the necessary information from the image to be able to identify the question.
+                                    To identify the question, we only need the following: question number and question part.
+                                    Don't include redundant information.
+                                    For example, if image contains text like: "Q1 Part A Answer: Life on earth was created by diety..."
+                                    you should return just "Q1 Part A Mark Scheme"
+                                    If there is no text on this image, return the description of the image. 20 words max.
+                                    If there are not enough data, consider information from the surrounding context.
+                                    Additionally, if the image contains a truncated part, you must describe it and mark as a
+                                    part of some another image that goes before or after current image.
+                        """},
                         {
                             "inline_data": {
                                 "mime_type": "image/jpeg",