InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency Paper • 2508.18265 • Published Aug 25, 2025 • 224
meituan-longcat/LongCat-Flash-Thinking Text Generation • 562B • Updated Sep 24, 2025 • 71 • 149
ibm-granite/granite-docling-258M Image-Text-to-Text • 0.3B • Updated Sep 23, 2025 • 103k • 1.2k
PaddlePaddle/PaddleOCR-VL Image-Text-to-Text • 1.0B • Updated about 18 hours ago • 7.4k • 1.63k