Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
talentestors 's Collections
qwen
embedding
VLM
TTS
Application
Generate-Image
DataSet
Generate-3D
LLM

VLM

updated 30 days ago
Upvote
-

  • deepseek-ai/DeepSeek-OCR

    Image-Text-to-Text • 3B • Updated Nov 4, 2025 • 2.99M • 3.24k

  • PaddlePaddle/PaddleOCR-VL

    Image-Text-to-Text • 1.0B • Updated 20 days ago • 11.4k • 1.6k

  • deepseek-ai/deepseek-vl2

    Image-Text-to-Text • Updated Dec 18, 2024 • 6.39k • 381

  • Qwen/Qwen3-VL-235B-A22B-Thinking

    Image-Text-to-Text • 236B • Updated Nov 26, 2025 • 12.2k • • 395

  • Qwen/Qwen3-VL-8B-Instruct

    Image-Text-to-Text • 9B • Updated Oct 15, 2025 • 6.31M • • 907

  • Qwen/Qwen3-VL-235B-A22B-Instruct

    Image-Text-to-Text • 236B • Updated Nov 26, 2025 • 1.6M • • 388

  • deepseek-ai/DeepSeek-OCR-2

    Image-Text-to-Text • 3B • Updated Feb 3 • 1.6M • 954

  • zai-org/GLM-OCR

    Image-Text-to-Text • 1B • Updated about 18 hours ago • 7.05M • • 1.75k

  • moonshotai/Kimi-K2.6

    Image-Text-to-Text • 1.1T • Updated about 22 hours ago • 2.48M • • 1.31k
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs