starride-teklia commited on
Commit
02878fc
·
verified ·
1 Parent(s): f4b711c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -6
README.md CHANGED
@@ -21,18 +21,24 @@ This model performs Handwritten Text Recognition in Austrian German.
21
  ## Model description
22
 
23
  The model has been trained using the PyLaia library on the [NewsEye / READ OCR training dataset from Austrian Newspapers (19th C.)](https://zenodo.org/record/3387369) dataset.
 
24
  Training images were resized with a fixed height of 128 pixels, keeping the original aspect ratio.
25
 
 
 
 
 
 
 
 
26
  ## Evaluation results
27
 
28
  The model achieves the following results:
29
 
30
- | Split | CER (%) | WER (%) | Support |
31
- |---------|-----------|-----------|-----------|
32
- | train | 1.62 | 5.63 | 38891 |
33
- | val | 1.82 | 7.77 | 3282 |
34
-
35
- Results can be improved by combining PyLaia with a n-gram language model.
36
 
37
  ## How to use
38
 
 
21
  ## Model description
22
 
23
  The model has been trained using the PyLaia library on the [NewsEye / READ OCR training dataset from Austrian Newspapers (19th C.)](https://zenodo.org/record/3387369) dataset.
24
+
25
  Training images were resized with a fixed height of 128 pixels, keeping the original aspect ratio.
26
 
27
+ | split | N lines |
28
+ | ----- | ------: |
29
+ | train | 52,834 |
30
+ | val | 4,667 |
31
+
32
+ An external 6-gram character language model can be used to improve recognition. The language model is trained on the text from the NorHand v3 training set.
33
+
34
  ## Evaluation results
35
 
36
  The model achieves the following results:
37
 
38
+ | set | Language model | CER (%) | WER (%) | N lines |
39
+ |:------|:---------------|:----------:|:-------:|----------:|
40
+ | val | no | 1.82 | 1.77 | 4,667 |
41
+ | val | yes | 7.77 | 7.01 | 4,667 |
 
 
42
 
43
  ## How to use
44