Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
oceansweep 's Collections
Coding
GGUF-related
LLMs-Using
VLMs
TTS
LLM-Models
Music_Gen
Datasweep
Personal-Projects
Papers
Relevant-Papers-Midterm
MAMBA-Models
Parametric-Compression
Training-related
Modeling-Martial-Artists

VLMs

updated Jun 23, 2024
Upvote
-

  • openbmb/MiniCPM-V-2

    Visual Question Answering • 3B • Updated Jan 15, 2025 • 14k • 499

  • HuggingFaceM4/idefics2-8b-base

    Image-Text-to-Text • 8B • Updated Jul 30, 2024 • 2k • 28

  • HuggingFaceM4/idefics2-8b

    Image-Text-to-Text • 8B • Updated Oct 14, 2024 • 134k • 624

  • Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

    Paper • 2311.06242 • Published Nov 10, 2023 • 97

  • microsoft/Florence-2-large-ft

    Image-Text-to-Text • 0.8B • Updated Aug 4, 2025 • 24.3k • 386

  • microsoft/kosmos-2.5

    Image-Text-to-Text • 1B • Updated Aug 28, 2025 • 124k • 271
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs