HuggingFaceTB/SmolVLM-256M-Instruct Image-Text-to-Text โข 0.3B โข Updated Apr 8, 2025 โข 772k โข 358
Qwen/Qwen2.5-VL-7B-Instruct Image-Text-to-Text โข 8B โข Updated Apr 6, 2025 โข 8.73M โข โข 1.53k
Runtime error Agents Featured 2.02k Chat With Janus-Pro-7B ๐ 2.02k A unified multimodal understanding and generation model.
Running on CPU Upgrade Agents 1.01k Open VLM Leaderboard ๐ 1.01k VLMEvalKit Evaluation Results Collection