| --- |
| title: Arabic PDF OCR |
| emoji: π |
| colorFrom: green |
| colorTo: blue |
| sdk: gradio |
| sdk_version: 4.44.0 |
| app_file: app.py |
| pinned: false |
| license: mit |
| --- |
| |
| # π Arabic PDF OCR |
|
|
| Extract text from Arabic PDFs using open-source OCR engines built for Arabic. |
|
|
| | Model | Quality | Notes | |
| |---|---|---| |
| | **QARI-OCR** | βββββ | Built specifically for Arabic by the NAMAA community. Handles diacritics, mixed fonts, complex layouts. | |
| | **EasyOCR** | ββββ | Deep-learning, great balance of speed and accuracy | |
| | **PaddleOCR** | ββββ | Excellent for clean printed Arabic text, very fast | |
|
|
| > For best results, enable a GPU in Space settings (ZeroGPU or T4). |
|
|