Qwen-AgentWorld: Language World Models for General Agents Paper • 2606.24597 • Published 13 days ago • 144
PP-OCRv6 Collection From 1.5M to 34.5M Parameters, Surpassing Billion-Scale VLMs on OCR Tasks • 19 items • Updated 20 days ago • 101
PaddleOCR-VL-1.6 Collection Expanding the Frontier of Document Parsing with Under-Optimized Region Refinement and Progressive Post-Training • 5 items • Updated May 29 • 14
PaddleOCR-VL-1.6: Expanding the Frontier of Document Parsing with Under-Optimized Region Refinement and Progressive Post-Training Paper • 2606.03264 • Published Jun 2 • 23
Open Vision, Layout & OCR Models by Loay Collection This collection hosts a series of Vision Language Models (VLMs) fine-tuned for Optical Character Recognition (OCR) and Document Processing. • 5 items • Updated Apr 14 • 1
Persian-Datasets Collection دیتاستهای متنوع برای آموزش و ارزیابی مدلهای فارسی؛ اعضا میتوانند دیتاستهای خود را به اشتراک بگذارند یا از منابع موجود بهره ببرند • 58 items • Updated Mar 2 • 11
LightOnOCR 🦉 Collection The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR • 6 items • Updated 11 days ago • 16
PaddleOCR-VL Collection Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model • 5 items • Updated Feb 11 • 32