HuggingFaceTB/SmolVLM-256M-Instruct Image-Text-to-Text โข 0.3B โข Updated Apr 8, 2025 โข 325k โข 345
Qwen/Qwen2.5-VL-7B-Instruct Image-Text-to-Text โข 8B โข Updated Apr 6, 2025 โข 4.78M โข โข 1.47k
Runtime error Featured 2.02k Chat With Janus-Pro-7B ๐ 2.02k A unified multimodal understanding and generation model.