File size: 1,053 Bytes
dffa2e7
6d64ed9
dffa2e7
6d64ed9
 
 
 
 
 
 
 
 
dffa2e7
6d64ed9
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
---
library_name: PyLaia
license: mit
tags:
- PyLaia
- PyTorch
- Handwritten text recognition
metrics:
- CER
- WER
language:
- ge
---

# Fraktur handwritten text recognition

This model performs Handwritten Text Recognition in Austrian German. 

## Model description

The model has been trained using the PyLaia library on the [NewsEye / READ OCR training dataset from Austrian Newspapers (19th C.)](https://zenodo.org/record/3387369) dataset.
Training images were resized with a fixed height of 128 pixels, keeping the original aspect ratio.

## Evaluation results

The model achieves the following results:

| Split   |   CER (%) |   WER (%) |   Support |
|---------|-----------|-----------|-----------|
| train   |      1.62 |      5.63 |     38891 |
| val     |      1.82 |      7.77 |      3282 |

Results can be improved by combining PyLaia with a n-gram language model. 

## How to use

Please refer to the PyLaia library page (https://pypi.org/project/pylaia/) and [wiki](https://github.com/jpuigcerver/PyLaia/wiki/inference) to use this model.