Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
d0zz0d 's Collections
Sub-5b Things
Sub-2b Things
Below Double-Digits - T2T LLMs
Below Double-Digits - IT2T VLMs
Sub-5b Parameter T2T LLMs
Sub-5b Parameter MM2T LLMs
Sub-5b T2T LLMs - Finetunes
Sub-2b Parameter T2T LLMs
Sub-2b Parameter IT2T VLMs
Sub-2b T2T LLMs - Finetunes
Sub-2b T2T LLMs - Historical
Sub-1b Parameter T2T LLMs
Sub-1b Parameter IT2T VLMs
Sub-1b T2T LLMs - Finetunes
Sub-1b T2T LLMs - Historical

Sub-2b Parameter IT2T VLMs

updated 28 days ago

A list of small Image-Text-to-Text Vision Language models, within the sub-2b parameter range.

Upvote
-

  • OpenGVLab/InternVL3_5-1B

    Image-Text-to-Text • 1B • Updated Aug 29 • 47.6k • 20

  • LiquidAI/LFM2-VL-1.6B

    Image-Text-to-Text • 2B • Updated 23 days ago • 1.84k • 216

  • opendatalab/MinerU2.5-2509-1.2B

    Image-Text-to-Text • 1B • Updated Sep 29 • 1.06M • 303

  • Qwen/Qwen3-VL-2B-Instruct

    Image-Text-to-Text • 2B • Updated Oct 23 • 1.23M • 244

  • Qwen/Qwen3-VL-2B-Thinking

    Image-Text-to-Text • 2B • Updated Oct 20 • 38.1k • 94
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs