Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
philipp-zettl 's Collections
Diffusion Language Models
DiffFT
RP
MTG Embedding models
RAG STACK
SO-Prep
VLMs
LargeWurstModels
Chess ♟️
F(T5+1)
ToS'
summarization
good-summaries
embedding-models
llamas
not closed TTS
sd-1.5
NPC models
secret sauce FLUX
ImageNet(s)
BG-RM
OCR

VLMs

updated Sep 24, 2025
Upvote
-

  • baidu/ERNIE-4.5-VL-28B-A3B-PT

    Image-Text-to-Text • Updated Jan 16 • 45.2k • • 93

    Note apache2.0


  • Qwen/Qwen2.5-VL-3B-Instruct

    Image-Text-to-Text • 4B • Updated Apr 6, 2025 • 21.4M • 609

    Note https://huggingface.co/Qwen/Qwen2.5-VL-3B-Instruct/blob/main/LICENSE requires commercial license [upon request]


  • Qwen/Qwen2.5-VL-7B-Instruct

    Image-Text-to-Text • Updated Apr 6, 2025 • 3.08M • • 1.45k

    Note apache2.0


  • Qwen/Qwen2-VL-2B-Instruct

    Image-Text-to-Text • Updated Jan 12, 2025 • 2.48M • 487

    Note apache2.0


  • Qwen/Qwen2-VL-7B

    Image-Text-to-Text • 8B • Updated Jan 12, 2025 • 13k • 63

    Note apache2.0


  • moonshotai/Kimi-VL-A3B-Thinking-2506

    Image-Text-to-Text • 16B • Updated 21 days ago • 57.5k • 350

    Note mit


  • vikhyatk/moondream2

    Image-Text-to-Text • Updated Sep 23, 2025 • 3.34M • 1.37k

    Note apache2.0

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs