Spaces:

Hammad712
/

Urdu-OCR-APP

Running

Hammad712 commited on 1 day ago

Commit

fb4c874

verified ·

1 Parent(s): a868e74

Update main.py

Files changed (1) hide show

main.py CHANGED Viewed

@@ -37,7 +37,17 @@ def extract_text_from_image(img):
             response = client.models.generate_content(
                 model="gemini-2.0-flash",
                 contents=[
-                    "Extract the text from the image. Preserve the original formatting exactly as it appears, including line breaks, spacing, and indentation. Do not write anything except the extracted content.",                    img,
                 ]
             )
             return response.text

             response = client.models.generate_content(
                 model="gemini-2.0-flash",
                 contents=[
+                   """Extract all visible text from this image and preserve the original layout and formatting as accurately as possible.
+- Maintain line breaks, indentation, and paragraph spacing.
+- Do not merge or reflow text from multiple lines into a single line.
+- Preserve bullet points, numbering, punctuation, and symbols exactly as shown.
+- Reproduce alignment (left/center/right) where possible.
+- For tabular or columnar data, preserve column spacing and structure.
+- Do not summarize or interpret the content. Just return the raw extracted text exactly as it appears in the image.
+Return only the extracted content. Do not add explanations, headers, or any additional comments.""",
+                   img,
                 ]
             )
             return response.text