handwriting-ocr / README.md
Raymond Weitekamp
fix version
d0c43d9
---
title: Handwriting OCR Data Collection
emoji: ✍️
colorFrom: blue
colorTo: indigo
sdk: gradio
sdk_version: 5.15.0
app_file: app.py
pinned: false
short_description: Collect handwritten text samples for OCR training
tags:
- ocr
- handwriting
- dataset
- computer-vision
hf_oauth: true
hf_oauth_expiration_minutes: 480
hf_oauth_scopes:
- read-repos
- write-repos
- manage-repos
- inference-api
---
# Handwriting OCR Dataset Collection
This Space provides an interface for collecting handwritten samples of text to create a dataset for OCR (Optical Character Recognition) training. Users are presented with text snippets which they can handwrite and upload as images.
## How it Works
1. You will be shown 1-5 consecutive sentences about OCR and handwriting recognition
2. Write these sentences by hand on paper
3. Take a photo or scan of your handwriting
4. Upload the image through the interface
5. Submit or skip to get a new text block
The collected data pairs (text and corresponding handwritten images) will be used to train and improve handwriting recognition models.
## Usage
Simply visit the Space and follow the on-screen instructions to contribute your handwriting samples to the dataset.