Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
valdanito 's Collections
ocr
asr
rmbg
tts
data structuring
llm
vlm
retrieval
medical

vlm

updated Jan 20
Upvote
-

  • Qwen/Qwen2.5-VL-7B-Instruct-AWQ

    Image-Text-to-Text • 8B • Updated Apr 6, 2025 • 716k • 103

  • Qwen/Qwen2.5-VL-3B-Instruct-AWQ

    Image-Text-to-Text • 4B • Updated Apr 6, 2025 • 88.3k • 63

  • Qwen/Qwen2.5-Omni-7B-AWQ

    Any-to-Any • 11B • Updated May 15, 2025 • 113k • 18

  • AIDC-AI/Ovis2-2B-GPTQ-Int4

    Image-Text-to-Text • 2B • Updated Mar 25, 2025 • 10 • 2

  • nvidia/Eagle2-9B

    Image-Text-to-Text • 9B • Updated Jan 28, 2025 • 457 • 63

  • Ertugrul/Qwen2.5-VL-7B-Captioner-Relaxed

    Image-Text-to-Text • 8B • Updated Mar 22, 2025 • 636 • 29

  • OpenGVLab/InternVL3-2B-AWQ

    Image-Text-to-Text • Updated Sep 11, 2025 • 28 • 1

  • Qwen/Qwen3-VL-30B-A3B-Instruct-FP8

    Image-Text-to-Text • Updated Nov 26, 2025 • 314k • 109

  • Qwen/Qwen3-VL-30B-A3B-Thinking-FP8

    Image-Text-to-Text • 31B • Updated Nov 26, 2025 • 14.3k • 54

  • Qwen/Qwen3-VL-32B-Instruct-FP8

    Image-Text-to-Text • Updated Oct 22, 2025 • 864k • 45

  • Qwen/Qwen3-VL-2B-Instruct-FP8

    Image-Text-to-Text • 2B • Updated Oct 20, 2025 • 315k • 39

  • Qwen/Qwen3-VL-4B-Instruct-FP8

    Image-Text-to-Text • 5B • Updated Oct 15, 2025 • 136k • 59
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs