Update README.md
Browse files
README.md
CHANGED
|
@@ -57,9 +57,9 @@ Fragment of "Libro de literatura en lengua chinanteca de Usila, Oaxaca" Lorenzo-
|
|
| 57 |
|
| 58 |
## Performance
|
| 59 |
|
| 60 |
-
- **Word Error Rate (WER)**: 1.
|
| 61 |
-
- **Character Error Rate (CER)**:
|
| 62 |
-
- **Accuracy**:
|
| 63 |
|
| 64 |
### Model Variants
|
| 65 |
|
|
@@ -168,7 +168,7 @@ The model was trained on a comprehensive dataset covering:
|
|
| 168 |
## Citation
|
| 169 |
|
| 170 |
```bibtex
|
| 171 |
-
@model{
|
| 172 |
title={Tachiwin OCR: A Tesseract Model for Mexico's Indigenous Languages},
|
| 173 |
author={[Tachiwin]},
|
| 174 |
year={2025},
|
|
|
|
| 57 |
|
| 58 |
## Performance
|
| 59 |
|
| 60 |
+
- **Word Error Rate (WER)**: 1.515% on evaluation dataset
|
| 61 |
+
- **Character Error Rate (CER)**: 0.503 % on evaluation dataset
|
| 62 |
+
- **Accuracy**: 98.5% word-level accuracy, 99.5% char-level accuracy
|
| 63 |
|
| 64 |
### Model Variants
|
| 65 |
|
|
|
|
| 168 |
## Citation
|
| 169 |
|
| 170 |
```bibtex
|
| 171 |
+
@model{tachiwin2025,
|
| 172 |
title={Tachiwin OCR: A Tesseract Model for Mexico's Indigenous Languages},
|
| 173 |
author={[Tachiwin]},
|
| 174 |
year={2025},
|