Update README.md
Browse files
README.md
CHANGED
|
@@ -22,7 +22,9 @@ pipeline_tag: image-text-to-text
|
|
| 22 |
|
| 23 |
## Model Description
|
| 24 |
|
| 25 |
-
QARI-OCR v0.3 is a specialized vision-language model fine-tuned for Arabic Optical Character Recognition with a focus on **structural document understanding**.
|
|
|
|
|
|
|
| 26 |
|
| 27 |
### Key Features
|
| 28 |
|
|
|
|
| 22 |
|
| 23 |
## Model Description
|
| 24 |
|
| 25 |
+
- QARI-OCR v0.3 is a specialized vision-language model fine-tuned for Arabic Optical Character Recognition with a focus on **structural document understanding**.
|
| 26 |
+
- Built on Qwen2-VL-2B-Instruct, this model excels at preserving document layouts, HTML tags, and formatting while transcribing Arabic text.
|
| 27 |
+
- It is described in detail in the paper [QARI-OCR: High-Fidelity Arabic Text Recognition through Multimodal Large Language Model Adaptation](https://huggingface.co/papers/2506.02295).
|
| 28 |
|
| 29 |
### Key Features
|
| 30 |
|