Update README.md
Browse filesExtend model card
README.md
CHANGED
|
@@ -3,6 +3,30 @@ tags:
|
|
| 3 |
- image-to-image
|
| 4 |
license: apache-2.0
|
| 5 |
---
|
| 6 |
-
|
| 7 |
|
| 8 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 3 |
- image-to-image
|
| 4 |
license: apache-2.0
|
| 5 |
---
|
| 6 |
+
# About `sbb_binarization`
|
| 7 |
|
| 8 |
+
This is a model for document image binarization. It can be used
|
| 9 |
+
to convert all pixels in a color or grayscale document image to
|
| 10 |
+
only black or white pixels. The main purpose is to improve the
|
| 11 |
+
contrast between foreground (text) and background (paper) for
|
| 12 |
+
purposes of OCR. The model is based on a `ResNet50-Unet` model.
|
| 13 |
+
|
| 14 |
+
# Results
|
| 15 |
+
In the *DocEng’2021 Time-quality binarization competition*
|
| 16 |
+
([paper](https://dib.cin.ufpe.br/docs/DocEng21_bin_competition_report.pdf)),
|
| 17 |
+
the model ranked 12 times under the top 8 of 63 methods in the
|
| 18 |
+
OCR-related quality scores, and won 2 tasks.
|
| 19 |
+
|
| 20 |
+
In the *ICDAR 2021 Competition on Time-Quality Document Image
|
| 21 |
+
Binarization* ([paper](https://dib.cin.ufpe.br/docs/papers/ICDAR2021-TQDIB_final_published.pdf)),
|
| 22 |
+
the model ranked 2 times under the top 20 of 61 methods,
|
| 23 |
+
and won 1 task.
|
| 24 |
+
|
| 25 |
+
For details, see [sbb_binarization](https://github.com/qurator-spk/sbb_binarization) on GitHub.
|
| 26 |
+
|
| 27 |
+
# Weights
|
| 28 |
+
We provide a `saved model` for Tensorflow2.
|
| 29 |
+
|
| 30 |
+
| Model | Downloads
|
| 31 |
+
| -------------| ------------------------
|
| 32 |
+
| `2021_03_09` | [`saved_model`](https://huggingface.co/SBB/sbb_binarization/tree/main/saved_model)
|