Update README.md
Browse files
README.md
CHANGED
|
@@ -10,7 +10,8 @@ pipeline_tag: text-generation
|
|
| 10 |
|
| 11 |
---
|
| 12 |
|
| 13 |
-
|
|
|
|
| 14 |
style="float:left;width:200px;height:200px;object-fit:cover;border-radius:50%;margin-right:16px;" />
|
| 15 |
|
| 16 |
**CaputEmendatoris** is a projection head for [Emendator](https://huggingface.com/aimgo/Emendator) trained to identify OCR artifacts in Latin text at a character level.
|
|
@@ -19,10 +20,17 @@ The model is intended to be used on segments of **250** characters. Anything els
|
|
| 19 |
|
| 20 |
Examples of CaputEmendatoris Annotations:
|
| 21 |
|
| 22 |
-
|
| 23 |
|
| 24 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
| 25 |
|
|
|
|
|
|
|
|
|
|
| 26 |
|
| 27 |
To use CaputEmendatoris, you can load it via the Transformers library:
|
| 28 |
|
|
|
|
| 10 |
|
| 11 |
---
|
| 12 |
|
| 13 |
+
|
| 14 |
+
<img src="https://cdn-uploads.huggingface.co/production/uploads/62cf05b026c94b143172379c/1cUXLP7zGJuWf3MPLv5_m.png"
|
| 15 |
style="float:left;width:200px;height:200px;object-fit:cover;border-radius:50%;margin-right:16px;" />
|
| 16 |
|
| 17 |
**CaputEmendatoris** is a projection head for [Emendator](https://huggingface.com/aimgo/Emendator) trained to identify OCR artifacts in Latin text at a character level.
|
|
|
|
| 20 |
|
| 21 |
Examples of CaputEmendatoris Annotations:
|
| 22 |
|
| 23 |
+
### Light Corruption
|
| 24 |
|
| 25 |
+
Orig: Antistes mihi milibus trecentis.
|
| 26 |
+
OCR: Antiftes mihi milibus trecentis: " . .. .ijiscnn p inr: h
|
| 27 |
+
^ ^^^^^^^^^^^^^^^^^^^^^^^^^^
|
| 28 |
+
|
| 29 |
+
### Heavy Corruption
|
| 30 |
|
| 31 |
+
Orig: Cognoscenda virtute circumscripta est scientia, quae ad experientiam pertinet et ad rationem.
|
| 32 |
+
OCR: C0gn0fccndauirtutccircurnfcriptacftfcientia:quacadcxpcricntiarnpcrtinct&adrationcrn«
|
| 33 |
+
^ ^^^^ ^ ^^ ^^^ ^ ^^^^ ^ ^^^ ^ ^ ^^ ^ ^ ^ ^^^^
|
| 34 |
|
| 35 |
To use CaputEmendatoris, you can load it via the Transformers library:
|
| 36 |
|