DiTo97 commited on
Commit
a4c13bc
·
1 Parent(s): e4e5c3d

Updated README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -4
README.md CHANGED
@@ -15,7 +15,9 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  # binarization-segformer-b3
17
 
18
- This model is a fine-tuned version of [nvidia/segformer-b3](https://huggingface.co/nvidia/segformer-b3-finetuned-cityscapes-1024-1024) on the same ensemble of 13 datasets as the [SauvolaNet work](https://arxiv.org/pdf/2105.05521.pdf). The ensemble is publicly available in the official [SauvolaNet repository](https://github.com/Leedeng/SauvolaNet#datasets).
 
 
19
 
20
  It achieves the following results on the evaluation set on DIBCO metrics:
21
  - loss: 0.1017
@@ -24,18 +26,18 @@ It achieves the following results on the evaluation set on DIBCO metrics:
24
  - PSNR: 14.5040
25
  - DRD: 5.3749
26
 
27
- where PSNR stands for peak signal-to-noise ratio and DND for distance reciprocal distortion.
28
 
29
  For more information on the above DIBCO metrics, see the 2017 introductory [paper](https://ieeexplore.ieee.org/document/8270159).
30
 
31
- **Warning:** This model only accepts images with a resolution of 640 due to compute constraints on Colab free tier during training.
32
 
33
  ## Model description
34
 
35
  This model is part of on-going research on pure semantic segmentation models as a formulation of document image binarization (DIBCO).
36
  This is in contrast to the late trend of adapting classic binarization algorithms with neural networks,
37
  such as [DeepOtsu](https://arxiv.org/abs/1901.06081) or the aforementioned SauvolaNet work,
38
- as extensions of the classical Otsu's method and Sauvola thresholding, respectively.
39
 
40
  ## Intended uses & limitations
41
 
 
15
 
16
  # binarization-segformer-b3
17
 
18
+ This model is a fine-tuned version of [nvidia/segformer-b3](https://huggingface.co/nvidia/segformer-b3-finetuned-cityscapes-1024-1024)
19
+ on the same ensemble of 13 datasets as the [SauvolaNet](https://arxiv.org/pdf/2105.05521.pdf) work publicly available
20
+ in their GitHub [repository](https://github.com/Leedeng/SauvolaNet#datasets).
21
 
22
  It achieves the following results on the evaluation set on DIBCO metrics:
23
  - loss: 0.1017
 
26
  - PSNR: 14.5040
27
  - DRD: 5.3749
28
 
29
+ with PSNR the peak signal-to-noise ratio and DND the distance reciprocal distortion.
30
 
31
  For more information on the above DIBCO metrics, see the 2017 introductory [paper](https://ieeexplore.ieee.org/document/8270159).
32
 
33
+ **Warning:** This model only accepts images with a resolution of 640 due to GPU compute constraints on Colab free tier during training.
34
 
35
  ## Model description
36
 
37
  This model is part of on-going research on pure semantic segmentation models as a formulation of document image binarization (DIBCO).
38
  This is in contrast to the late trend of adapting classic binarization algorithms with neural networks,
39
  such as [DeepOtsu](https://arxiv.org/abs/1901.06081) or the aforementioned SauvolaNet work,
40
+ as extensions of the classical Otsu's method and Sauvola thresholding algorithm, respectively.
41
 
42
  ## Intended uses & limitations
43