Vision Models - a DinithKumudika Collection

DinithKumudika 's Collections

datasets (OCR/document)

object detection

Vision Models

updated 19 days ago

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Apr 6, 2025 • 10.2M • • 1.62k
docling-project/SmolDocling-256M-preview

Image-Text-to-Text • 0.3B • Updated Sep 17, 2025 • 27.9k • 1.62k
OpenGVLab/InternVL3-8B-hf

Image-Text-to-Text • 8B • Updated Apr 23, 2025 • 58.2k • 9
LiquidAI/LFM2-VL-3B

Image-Text-to-Text • 3B • Updated Mar 30 • 14.6k • 134
nvidia/LocateAnything-3B

Image-Text-to-Text • 4B • Updated Jun 12 • 1.5M • 2.72k