Optimized Table Tokenization for Table Structure Recognition Paper • 2305.03393 • Published May 5, 2023 • 1
MolGrapher: Graph-based Visual Recognition of Chemical Structures Paper • 2308.12234 • Published Aug 23, 2023
DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis Paper • 2206.01062 • Published Jun 2, 2022 • 3
KVP10k : A Comprehensive Dataset for Key-Value Pair Extraction in Business Documents Paper • 2405.00505 • Published May 1, 2024
TableFormer: Table Structure Understanding with Transformers Paper • 2203.01017 • Published Mar 2, 2022
Docling: An Efficient Open-Source Toolkit for AI-driven Document Conversion Paper • 2501.17887 • Published Jan 27, 2025 • 1
Granite Vision: a lightweight, open-source multimodal model for enterprise Intelligence Paper • 2502.09927 • Published Feb 14, 2025
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper • 2503.11576 • Published Mar 14, 2025 • 136
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper • 2503.11576 • Published Mar 14, 2025 • 136
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper • 2503.11576 • Published Mar 14, 2025 • 136
Granite Vision: a lightweight, open-source multimodal model for enterprise Intelligence Paper • 2502.09927 • Published Feb 14, 2025
GLOV: Guided Large Language Models as Implicit Optimizers for Vision Language Models Paper • 2410.06154 • Published Oct 8, 2024 • 16
Self-MoE: Towards Compositional Large Language Models with Self-Specialized Experts Paper • 2406.12034 • Published Jun 17, 2024 • 16
$\textit{Trans-LoRA}$: towards data-free Transferable Parameter Efficient Finetuning Paper • 2405.17258 • Published May 27, 2024 • 16
INDUS: Effective and Efficient Language Models for Scientific Applications Paper • 2405.10725 • Published May 17, 2024 • 34
Granite Code Models: A Family of Open Foundation Models for Code Intelligence Paper • 2405.04324 • Published May 7, 2024 • 25
LangNav: Language as a Perceptual Representation for Navigation Paper • 2310.07889 • Published Oct 11, 2023 • 6