Updated README.md
Browse files
README.md
CHANGED
|
@@ -15,7 +15,9 @@ should probably proofread and complete it, then remove this comment. -->
|
|
| 15 |
|
| 16 |
# binarization-segformer-b3
|
| 17 |
|
| 18 |
-
This model is a fine-tuned version of [nvidia/segformer-b3](https://huggingface.co/nvidia/segformer-b3-finetuned-cityscapes-1024-1024)
|
|
|
|
|
|
|
| 19 |
|
| 20 |
It achieves the following results on the evaluation set on DIBCO metrics:
|
| 21 |
- loss: 0.1017
|
|
@@ -24,18 +26,18 @@ It achieves the following results on the evaluation set on DIBCO metrics:
|
|
| 24 |
- PSNR: 14.5040
|
| 25 |
- DRD: 5.3749
|
| 26 |
|
| 27 |
-
|
| 28 |
|
| 29 |
For more information on the above DIBCO metrics, see the 2017 introductory [paper](https://ieeexplore.ieee.org/document/8270159).
|
| 30 |
|
| 31 |
-
**Warning:** This model only accepts images with a resolution of 640 due to compute constraints on Colab free tier during training.
|
| 32 |
|
| 33 |
## Model description
|
| 34 |
|
| 35 |
This model is part of on-going research on pure semantic segmentation models as a formulation of document image binarization (DIBCO).
|
| 36 |
This is in contrast to the late trend of adapting classic binarization algorithms with neural networks,
|
| 37 |
such as [DeepOtsu](https://arxiv.org/abs/1901.06081) or the aforementioned SauvolaNet work,
|
| 38 |
-
as extensions of the classical Otsu's method and Sauvola thresholding, respectively.
|
| 39 |
|
| 40 |
## Intended uses & limitations
|
| 41 |
|
|
|
|
| 15 |
|
| 16 |
# binarization-segformer-b3
|
| 17 |
|
| 18 |
+
This model is a fine-tuned version of [nvidia/segformer-b3](https://huggingface.co/nvidia/segformer-b3-finetuned-cityscapes-1024-1024)
|
| 19 |
+
on the same ensemble of 13 datasets as the [SauvolaNet](https://arxiv.org/pdf/2105.05521.pdf) work publicly available
|
| 20 |
+
in their GitHub [repository](https://github.com/Leedeng/SauvolaNet#datasets).
|
| 21 |
|
| 22 |
It achieves the following results on the evaluation set on DIBCO metrics:
|
| 23 |
- loss: 0.1017
|
|
|
|
| 26 |
- PSNR: 14.5040
|
| 27 |
- DRD: 5.3749
|
| 28 |
|
| 29 |
+
with PSNR the peak signal-to-noise ratio and DND the distance reciprocal distortion.
|
| 30 |
|
| 31 |
For more information on the above DIBCO metrics, see the 2017 introductory [paper](https://ieeexplore.ieee.org/document/8270159).
|
| 32 |
|
| 33 |
+
**Warning:** This model only accepts images with a resolution of 640 due to GPU compute constraints on Colab free tier during training.
|
| 34 |
|
| 35 |
## Model description
|
| 36 |
|
| 37 |
This model is part of on-going research on pure semantic segmentation models as a formulation of document image binarization (DIBCO).
|
| 38 |
This is in contrast to the late trend of adapting classic binarization algorithms with neural networks,
|
| 39 |
such as [DeepOtsu](https://arxiv.org/abs/1901.06081) or the aforementioned SauvolaNet work,
|
| 40 |
+
as extensions of the classical Otsu's method and Sauvola thresholding algorithm, respectively.
|
| 41 |
|
| 42 |
## Intended uses & limitations
|
| 43 |
|