LightOnOCR: A 1B End-to-End Multilingual Vision-Language Model for State-of-the-Art OCR Paper • 2601.14251 • Published 6 days ago • 23
LightOnOCR-2 🦉 Collection LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family • 12 items • Updated 5 days ago • 17
view article Article LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family 7 days ago • 60
view article Article LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR Oct 23, 2025 • 70
view article Article Introducing EuroBERT: A High-Performance Multilingual Encoder Model Mar 10, 2025 • 146
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 156
view article Article ArabicWeb24: Creating a High Quality Arabic Web-only Pre-training Dataset Aug 8, 2024 • 11