InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency Paper • 2508.18265 • Published Aug 25, 2025 • 224
hfl/Qwen2.5-VL-3B-Instruct-GPTQ-Int4 Image-Text-to-Text • 4B • Updated Mar 20, 2025 • 659 • 3
google/siglip2-so400m-patch14-384 Zero-Shot Image Classification • 1B • Updated Feb 21, 2025 • 676k • 86
Running on CPU Upgrade Agents 1.02k Open VLM Leaderboard 🌎 1.02k VLMEvalKit Evaluation Results Collection