Spaces:

Omarrran
/

ocr_iitr

Sleeping

App Files Files Community

Omarrran commited on Sep 30, 2024

Commit

d78ceed

verified ·

1 Parent(s): 698c9a9

Update README.md

Browse files

Files changed (1) hide show

README.md +79 -0

README.md CHANGED Viewed

@@ -9,5 +9,84 @@ app_file: app.py
 pinned: false
 license: mit
 ---
 Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

 pinned: false
 license: mit
 ---
+# OCR Image to Text Converter
+## Overview
+This project is an OCR (Optical Character Recognition) application that allows users to extract text from images. It supports multiple languages and provides a user-friendly interface for uploading images, performing OCR, and searching within the extracted text.
+## Features
+- Upload images for text extraction
+- Support for multiple languages (including Hindi, English, Thai, French, Chinese, Japanese, and Korean)
+- Visualize OCR results with bounding boxes around detected text
+- Search functionality within the extracted text
+- Example images provided for testing
+## Technologies Used
+- Python
+- Gradio (for the web interface)
+- EasyOCR (for optical character recognition)
+- PIL (Python Imaging Library)
+- PyTorch
+## Setup and Installation
+1. Clone this repository:
+   ```
+   git clone https://huggingface.co/spaces/Omarrran/ocr_iitr
+   cd ocr_iitr
+   ```
+2. Install the required dependencies:
+   ```
+   pip install pillow gradio torch easyocr
+   ```
+3. Run the application:
+   ```
+   python app.py
+   ```
+4. Open your web browser and navigate to the local URL provided by Gradio (usually `http://127.0.0.1:7860`).
+## Usage
+1. Upload an image or select one of the provided examples.
+2. Choose the language(s) for OCR processing.
+3. Click the "Perform OCR" button to extract text from the image.
+4. View the result image with bounding boxes and the extracted text.
+5. Use the search box to find specific text within the extracted content.
+## Example Images
+The application includes several example images for testing:
+- English text
+- Hindi text
+- Thai text
+- French text
+- Chinese text
+- Japanese text
+- Korean text
+These images are automatically downloaded when you run the application.
+## Alternative Version
+An alternative, ready-to-use version of this OCR application is available on Vercel. You can access it [here](https://iitr-haq-nawaz-maliks-projects.vercel.app/).
+## Notes
+- The OCR process may take a few seconds to complete, especially for larger or more complex images.
+- The accuracy of text extraction may vary depending on the quality of the input image and the complexity of the text.
+## Contributing
+Contributions to improve the application are welcome. Please feel free to submit issues or pull requests.
+## License
+license: mit
 Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference