Update README.md
Browse files
README.md
CHANGED
|
@@ -11,7 +11,15 @@ widget:
|
|
| 11 |
# Persian-OCR
|
| 12 |
|
| 13 |
**Persian-OCR** is a deep learning model for **Optical Character Recognition (OCR)**, designed specifically for Persian text.
|
| 14 |
-
The model
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 15 |
|
| 16 |
## Files
|
| 17 |
|
|
|
|
| 11 |
# Persian-OCR
|
| 12 |
|
| 13 |
**Persian-OCR** is a deep learning model for **Optical Character Recognition (OCR)**, designed specifically for Persian text.
|
| 14 |
+
The model employs a **CNN + Transformer architecture** trained with **CTC loss** to extract text from images.
|
| 15 |
+
|
| 16 |
+
The model was trained on a custom dataset of approximately **600,000 synthetic Persian text images**.
|
| 17 |
+
These images were generated from **Wikipedia text** using **49 different Persian fonts**, with sequence lengths ranging from **0 to 150 characters**.
|
| 18 |
+
|
| 19 |
+
On this dataset, the model achieves a **sequence accuracy of 96%**.
|
| 20 |
+
|
| 21 |
+
The model may benefit from **further fine-tuning on real-world data**, and contributions or collaborations are **warmly welcomed**.
|
| 22 |
+
|
| 23 |
|
| 24 |
## Files
|
| 25 |
|