chore: switch to Qwen2.5-VL-3B-Instruct for OCR
Browse files- Upgrade from Qwen2-VL-2B to Qwen2.5-VL-3B
- Improved model with better performance
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude <noreply@anthropic.com>
app.py
CHANGED
|
@@ -10,8 +10,8 @@ from PIL import Image
|
|
| 10 |
from transformers import Qwen2VLForConditionalGeneration, AutoProcessor
|
| 11 |
from qwen_vl_utils import process_vision_info
|
| 12 |
|
| 13 |
-
# Qwen2-VL 모델 ID
|
| 14 |
-
MODEL_ID = "Qwen/Qwen2-VL-
|
| 15 |
|
| 16 |
|
| 17 |
def _extract_assistant_content(decoded: str) -> str:
|
|
@@ -298,7 +298,7 @@ with gr.Blocks(theme=gr.themes.Soft(), css=CUSTOM_CSS) as demo:
|
|
| 298 |
---
|
| 299 |
|
| 300 |
**ℹ️ OCR 모델**
|
| 301 |
-
- Qwen2-VL-
|
| 302 |
- 한국어, 영어 등 다국어 지원
|
| 303 |
""")
|
| 304 |
|
|
|
|
| 10 |
from transformers import Qwen2VLForConditionalGeneration, AutoProcessor
|
| 11 |
from qwen_vl_utils import process_vision_info
|
| 12 |
|
| 13 |
+
# Qwen2.5-VL 모델 ID
|
| 14 |
+
MODEL_ID = "Qwen/Qwen2.5-VL-3B-Instruct"
|
| 15 |
|
| 16 |
|
| 17 |
def _extract_assistant_content(decoded: str) -> str:
|
|
|
|
| 298 |
---
|
| 299 |
|
| 300 |
**ℹ️ OCR 모델**
|
| 301 |
+
- Qwen2.5-VL-3B-Instruct - 최첨단 비전-언어 모델 기반 OCR (GPT-4o 수준)
|
| 302 |
- 한국어, 영어 등 다국어 지원
|
| 303 |
""")
|
| 304 |
|