Update README.md
Browse files
README.md
CHANGED
|
@@ -6,7 +6,7 @@ pipeline_tag: text-generation
|
|
| 6 |
<img src="https://cdn-uploads.huggingface.co/production/uploads/62cf05b026c94b143172379c/TzFC1Lo2kZ_38hAay9Wf2.png"
|
| 7 |
style="float:left;width:200px;height:200px;object-fit:cover;border-radius:50%;margin-right:16px;" />
|
| 8 |
|
| 9 |
-
[byt5-xl](https://huggingface.co/google/byt5-xl) finetuned to correct OCR artifacts in Latin text.
|
| 10 |
|
| 11 |
**This model cannot provide completely faithful reconstruction for all orthographies - on a large scale, it will shift the distribution of tokens towards what that which it has been trained on.**
|
| 12 |
As such, use it only in circumstances when the primary concern is only to recover intelligble Latin, not when the concern is to recover intelligble Latin of a particular style.
|
|
|
|
| 6 |
<img src="https://cdn-uploads.huggingface.co/production/uploads/62cf05b026c94b143172379c/TzFC1Lo2kZ_38hAay9Wf2.png"
|
| 7 |
style="float:left;width:200px;height:200px;object-fit:cover;border-radius:50%;margin-right:16px;" />
|
| 8 |
|
| 9 |
+
**Emendator** is a [byt5-xl](https://huggingface.co/google/byt5-xl) model finetuned to correct OCR artifacts in Latin text.
|
| 10 |
|
| 11 |
**This model cannot provide completely faithful reconstruction for all orthographies - on a large scale, it will shift the distribution of tokens towards what that which it has been trained on.**
|
| 12 |
As such, use it only in circumstances when the primary concern is only to recover intelligble Latin, not when the concern is to recover intelligble Latin of a particular style.
|