Nemotron-Cascade 2 Collection Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation • 4 items • Updated 16 days ago • 50
OmniScience: A Large-scale Multi-modal Dataset for Scientific Image Understanding Paper • 2602.13758 • Published Feb 14 • 6
view article Article 流式数据集:效率提升 100 倍 +3 andito, lhoestq, burtenshaw, pcuenq, merve • Oct 27, 2025 • 7
view article Article LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family lightonai • Jan 19 • 96
RxnBench: A Multimodal Benchmark for Evaluating Large Language Models on Chemical Reaction Understanding from Scientific Literature Paper • 2512.23565 • Published Dec 29, 2025 • 1
view article Article NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks nvidia • Aug 11, 2025 • 76
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper • 2503.11576 • Published Mar 14, 2025 • 164
view article Article SmolVLM2: Bringing Video Understanding to Every Device +5 orrzohar, mfarre, andito, merve, pcuenq, cyrilzakka, Xenova • Feb 20, 2025 • 343
view article Article Open-source DeepResearch – Freeing our search agents +3 m-ric, albertvillanova, merve, thomwolf, clefourrier • Feb 4, 2025 • 1.32k
view article Article Timm ❤️ Transformers: Use any timm model with transformers +3 ariG23498, rwightman, qubvel-hf, pcuenq, reach-vb • Jan 16, 2025 • 55
MolParser: End-to-end Visual Recognition of Molecule Structures in the Wild Paper • 2411.11098 • Published Nov 17, 2024 • 1