prithivMLmods commited on
Commit
2112365
Β·
verified Β·
1 Parent(s): 74afa98

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +14 -14
README.md CHANGED
@@ -119,22 +119,22 @@ print(output_text)
119
 
120
  **LatexMind-2B-Codec** is designed for tasks that require **image-based text recognition**, **math equation extraction**, and **multi-modal understanding**. It is particularly useful in the following scenarios:
121
 
122
- πŸ”Ή **Optical Character Recognition (OCR)** – Extracting printed and handwritten text from images, documents, and scanned pages.
123
- πŸ”Ή **Math Expression Recognition** – Converting mathematical notations into structured **LaTeX format** for further computation and documentation.
124
- πŸ”Ή **Image-to-Text Conversion** – Generating accurate descriptions for text-rich and math-heavy images.
125
- πŸ”Ή **Document and Academic Processing** – Assisting researchers, students, and professionals in digitizing handwritten notes and extracting structured content from books, PDFs, and whiteboards.
126
- πŸ”Ή **Automated Educational Support** – Enabling AI-powered tutors, content summarization, and interactive learning for subjects involving complex equations.
127
- πŸ”Ή **Multi-Language OCR** – Recognizing text inside images across multiple languages, including English, Chinese, Japanese, Korean, Arabic, and various European languages.
128
- πŸ”Ή **Video-Based Question Answering** – Understanding long-duration videos for content summarization, question answering, and structured data extraction.
129
 
130
  # Limitations
131
 
132
  Despite its capabilities, **LatexMind-2B-Codec** has some inherent limitations:
133
 
134
- ⚠ **Handwritten Text Accuracy** – While it can recognize handwritten equations, performance may degrade with highly unstructured or messy handwriting.
135
- ⚠ **Complex LaTeX Formatting** – The model may struggle with deeply nested or ambiguous LaTeX expressions, requiring manual corrections for precise formatting.
136
- ⚠ **Low-Resolution Images** – Extracting accurate text from blurry or low-resolution images can lead to misinterpretations or OCR errors.
137
- ⚠ **Contextual Understanding in Multi-Step Equations** – While it recognizes math expressions, solving multi-step problems autonomously may be limited.
138
- ⚠ **Limited Support for Rare Mathematical Notations** – Some specialized or domain-specific symbols may not be recognized with high accuracy.
139
- ⚠ **Processing Speed for Large Documents** – Performance may slow down when handling extremely large documents or dense mathematical content in real-time applications.
140
- ⚠ **Language-Specific OCR Variability** – While it supports multiple languages, OCR accuracy may vary depending on the script complexity and font style.
 
119
 
120
  **LatexMind-2B-Codec** is designed for tasks that require **image-based text recognition**, **math equation extraction**, and **multi-modal understanding**. It is particularly useful in the following scenarios:
121
 
122
+ **Optical Character Recognition (OCR)** – Extracting printed and handwritten text from images, documents, and scanned pages.
123
+ **Math Expression Recognition** – Converting mathematical notations into structured **LaTeX format** for further computation and documentation.
124
+ **Image-to-Text Conversion** – Generating accurate descriptions for text-rich and math-heavy images.
125
+ **Document and Academic Processing** – Assisting researchers, students, and professionals in digitizing handwritten notes and extracting structured content from books, PDFs, and whiteboards.
126
+ **Automated Educational Support** – Enabling AI-powered tutors, content summarization, and interactive learning for subjects involving complex equations.
127
+ **Multi-Language OCR** – Recognizing text inside images across multiple languages, including English, Chinese, Japanese, Korean, Arabic, and various European languages.
128
+ **Video-Based Question Answering** – Understanding long-duration videos for content summarization, question answering, and structured data extraction.
129
 
130
  # Limitations
131
 
132
  Despite its capabilities, **LatexMind-2B-Codec** has some inherent limitations:
133
 
134
+ **Handwritten Text Accuracy** – While it can recognize handwritten equations, performance may degrade with highly unstructured or messy handwriting.
135
+ **Complex LaTeX Formatting** – The model may struggle with deeply nested or ambiguous LaTeX expressions, requiring manual corrections for precise formatting.
136
+ **Low-Resolution Images** – Extracting accurate text from blurry or low-resolution images can lead to misinterpretations or OCR errors.
137
+ **Contextual Understanding in Multi-Step Equations** – While it recognizes math expressions, solving multi-step problems autonomously may be limited.
138
+ **Limited Support for Rare Mathematical Notations** – Some specialized or domain-specific symbols may not be recognized with high accuracy.
139
+ **Processing Speed for Large Documents** – Performance may slow down when handling extremely large documents or dense mathematical content in real-time applications.
140
+ **Language-Specific OCR Variability** – While it supports multiple languages, OCR accuracy may vary depending on the script complexity and font style.