starride-teklia
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -21,18 +21,24 @@ This model performs Handwritten Text Recognition in Austrian German.
|
|
21 |
## Model description
|
22 |
|
23 |
The model has been trained using the PyLaia library on the [NewsEye / READ OCR training dataset from Austrian Newspapers (19th C.)](https://zenodo.org/record/3387369) dataset.
|
|
|
24 |
Training images were resized with a fixed height of 128 pixels, keeping the original aspect ratio.
|
25 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
26 |
## Evaluation results
|
27 |
|
28 |
The model achieves the following results:
|
29 |
|
30 |
-
|
|
31 |
-
|
32 |
-
|
|
33 |
-
| val
|
34 |
-
|
35 |
-
Results can be improved by combining PyLaia with a n-gram language model.
|
36 |
|
37 |
## How to use
|
38 |
|
|
|
21 |
## Model description
|
22 |
|
23 |
The model has been trained using the PyLaia library on the [NewsEye / READ OCR training dataset from Austrian Newspapers (19th C.)](https://zenodo.org/record/3387369) dataset.
|
24 |
+
|
25 |
Training images were resized with a fixed height of 128 pixels, keeping the original aspect ratio.
|
26 |
|
27 |
+
| split | N lines |
|
28 |
+
| ----- | ------: |
|
29 |
+
| train | 52,834 |
|
30 |
+
| val | 4,667 |
|
31 |
+
|
32 |
+
An external 6-gram character language model can be used to improve recognition. The language model is trained on the text from the NorHand v3 training set.
|
33 |
+
|
34 |
## Evaluation results
|
35 |
|
36 |
The model achieves the following results:
|
37 |
|
38 |
+
| set | Language model | CER (%) | WER (%) | N lines |
|
39 |
+
|:------|:---------------|:----------:|:-------:|----------:|
|
40 |
+
| val | no | 1.82 | 1.77 | 4,667 |
|
41 |
+
| val | yes | 7.77 | 7.01 | 4,667 |
|
|
|
|
|
42 |
|
43 |
## How to use
|
44 |
|