Gradient-based Uncertainty Attribution for Explainable Bayesian Deep Learning Paper • 2304.04824 • Published Apr 10, 2023
Data-Prep-Kit: getting your data ready for LLM application development Paper • 2409.18164 • Published Sep 26, 2024
Granite Vision: a lightweight, open-source multimodal model for enterprise Intelligence Paper • 2502.09927 • Published Feb 14, 2025
ChartGen: Scaling Chart Understanding Via Code-Guided Synthetic Chart Generation Paper • 2507.19492 • Published May 31, 2025 • 1
Composition-Grounded Instruction Synthesis for Visual Reasoning Paper • 2510.15040 • Published Oct 16, 2025
ChartNet: A Million-Scale, High-Quality Multimodal Dataset for Robust Chart Understanding Paper • 2603.27064 • Published Mar 28 • 28
ChartGen: Scaling Chart Understanding Via Code-Guided Synthetic Chart Generation Paper • 2507.19492 • Published May 31, 2025 • 1
Composition-Grounded Instruction Synthesis for Visual Reasoning Paper • 2510.15040 • Published Oct 16, 2025
ChartNet: A Million-Scale, High-Quality Multimodal Dataset for Robust Chart Understanding Paper • 2603.27064 • Published Mar 28 • 28
Optimized Table Tokenization for Table Structure Recognition Paper • 2305.03393 • Published May 5, 2023 • 1
MolGrapher: Graph-based Visual Recognition of Chemical Structures Paper • 2308.12234 • Published Aug 23, 2023
DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis Paper • 2206.01062 • Published Jun 2, 2022 • 3
KVP10k : A Comprehensive Dataset for Key-Value Pair Extraction in Business Documents Paper • 2405.00505 • Published May 1, 2024
TableFormer: Table Structure Understanding with Transformers Paper • 2203.01017 • Published Mar 2, 2022 • 1
Docling: An Efficient Open-Source Toolkit for AI-driven Document Conversion Paper • 2501.17887 • Published Jan 27, 2025 • 1
Granite Vision: a lightweight, open-source multimodal model for enterprise Intelligence Paper • 2502.09927 • Published Feb 14, 2025
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper • 2503.11576 • Published Mar 14, 2025 • 158
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper • 2503.11576 • Published Mar 14, 2025 • 158
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper • 2503.11576 • Published Mar 14, 2025 • 158