EpsilonGreedy's picture
Update README.md
c38b422 verified
---
license: afl-3.0
datasets:
- fhswf/german_handwriting
language:
- de
metrics:
- cer
base_model:
- microsoft/trocr-small-handwritten
---
# EpsilonGreedy/TrOCR_small_german_handwritten
This is a small 250mio german handwriting ocr model. <br>
It works best on images of coherent lines or phrases of text. <br>
Does not work on multiline images. <br>
You can find training and eval metrics in the metrics tab <br> <br>
Only evaluated: CER of around 4.4% <br>
You can find the big brother over here (https://huggingface.co/fhswf/TrOCR_german_handwritten)
which has CER 4.1% and WER of 17.5%. So WER will be analogous.
Thanks to Fachhochschule Südwestfalen
(https://huggingface.co/fhswf) for the great dataset (https://huggingface.co/datasets/fhswf/german_handwriting).
Further thanks to:<br>
Microsoft for the base model.<br>
Google for letting me train on kaggle for free! (https://www.kaggle.com/code/treeinsight/trocr-german-small-finetune)<br>
Huggingface for hosting this model.