File size: 1,217 Bytes
d4167d9
a345416
d4167d9
 
 
 
d0c43d9
d4167d9
 
 
 
 
 
 
 
a345416
 
 
 
 
 
 
 
d4167d9
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
---
title: Handwriting OCR Data Collection
emoji: ✍️
colorFrom: blue
colorTo: indigo
sdk: gradio
sdk_version: 5.15.0
app_file: app.py
pinned: false
short_description: Collect handwritten text samples for OCR training
tags:
- ocr
- handwriting
- dataset
- computer-vision

hf_oauth: true
hf_oauth_expiration_minutes: 480
hf_oauth_scopes:
  - read-repos
  - write-repos
  - manage-repos
  - inference-api
---

# Handwriting OCR Dataset Collection

This Space provides an interface for collecting handwritten samples of text to create a dataset for OCR (Optical Character Recognition) training. Users are presented with text snippets which they can handwrite and upload as images.

## How it Works

1. You will be shown 1-5 consecutive sentences about OCR and handwriting recognition
2. Write these sentences by hand on paper
3. Take a photo or scan of your handwriting
4. Upload the image through the interface
5. Submit or skip to get a new text block

The collected data pairs (text and corresponding handwritten images) will be used to train and improve handwriting recognition models.

## Usage

Simply visit the Space and follow the on-screen instructions to contribute your handwriting samples to the dataset.