update citation, add image, tags, and changed pipeline
Browse files
README.md
CHANGED
|
@@ -6,6 +6,7 @@ tags:
|
|
| 6 |
- unsloth
|
| 7 |
- qwen2_vl
|
| 8 |
- trl
|
|
|
|
| 9 |
license: apache-2.0
|
| 10 |
language:
|
| 11 |
- ar
|
|
@@ -13,13 +14,20 @@ metrics:
|
|
| 13 |
- bleu
|
| 14 |
- wer
|
| 15 |
- cer
|
|
|
|
|
|
|
| 16 |
---
|
| 17 |
|
| 18 |
# Qari-OCR-0.1-VL-2B-Instruct Model
|
| 19 |
|
| 20 |
## Model Overview
|
|
|
|
| 21 |
This model is a fine-tuned version of [unsloth/Qwen2-VL-2B-Instruct](unsloth/Qwen2-VL-2B-Instruct-unsloth-bnb-4bit) on an Arabic OCR dataset. It is optimized to perform Arabic Optical Character Recognition (OCR) for full-page text.
|
| 22 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 23 |
## Model Details
|
| 24 |
- **Base Model**: Qwen2 VL
|
| 25 |
- **Fine-tuning Dataset**: Arabic OCR dataset
|
|
@@ -128,4 +136,16 @@ print(output_text)
|
|
| 128 |
This model follows the licensing terms of the original Qwen2 VL model. Please review the terms before using it commercially.
|
| 129 |
|
| 130 |
## Citation
|
| 131 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 6 |
- unsloth
|
| 7 |
- qwen2_vl
|
| 8 |
- trl
|
| 9 |
+
- ocr
|
| 10 |
license: apache-2.0
|
| 11 |
language:
|
| 12 |
- ar
|
|
|
|
| 14 |
- bleu
|
| 15 |
- wer
|
| 16 |
- cer
|
| 17 |
+
pipeline_tag: image-text-to-text
|
| 18 |
+
library_name: transformers
|
| 19 |
---
|
| 20 |
|
| 21 |
# Qari-OCR-0.1-VL-2B-Instruct Model
|
| 22 |
|
| 23 |
## Model Overview
|
| 24 |
+
|
| 25 |
This model is a fine-tuned version of [unsloth/Qwen2-VL-2B-Instruct](unsloth/Qwen2-VL-2B-Instruct-unsloth-bnb-4bit) on an Arabic OCR dataset. It is optimized to perform Arabic Optical Character Recognition (OCR) for full-page text.
|
| 26 |
|
| 27 |
+
|
| 28 |
+

|
| 29 |
+
|
| 30 |
+
|
| 31 |
## Model Details
|
| 32 |
- **Base Model**: Qwen2 VL
|
| 33 |
- **Fine-tuning Dataset**: Arabic OCR dataset
|
|
|
|
| 136 |
This model follows the licensing terms of the original Qwen2 VL model. Please review the terms before using it commercially.
|
| 137 |
|
| 138 |
## Citation
|
| 139 |
+
|
| 140 |
+
If you use this model in your research, please cite:
|
| 141 |
+
|
| 142 |
+
```
|
| 143 |
+
@misc{QariOCR2025,
|
| 144 |
+
title={Qari-OCR: A High-Accuracy Model for Arabic Optical Character Recognition},
|
| 145 |
+
author={NAMAA},
|
| 146 |
+
year={2025},
|
| 147 |
+
publisher={Hugging Face},
|
| 148 |
+
howpublished={\url{https://huggingface.co/NAMAA-Space/Qari-OCR-0.1-VL-2B-Instruct}},
|
| 149 |
+
note={Accessed: 2025-03-03}
|
| 150 |
+
}
|
| 151 |
+
```
|