Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
A-Bagdasaryan 's Collections
VLM
Image-to-image
Deep Research
Text-to-speech
Image-to-Text
Text-to-Image

Image-to-Text

updated about 1 month ago
Upvote
-

  • Build error
    Agents
    432

    BLIP

    🦀
    432


  • microsoft/Florence-2-large

    Image-Text-to-Text • 0.8B • Updated Aug 4, 2025 • 994k • 1.8k

  • Salesforce/blip2-opt-2.7b

    Image-Text-to-Text • 4B • Updated Feb 3, 2025 • 464k • 439

  • Running on Zero
    Agents
    Featured
    219

    JanusFlow 1.3B

    🏃
    219

    Huggingface space for JanusFlow-1.3B


  • Runtime error
    Agents
    Featured
    2.02k

    Chat With Janus-Pro-7B

    🌍
    2.02k

    A unified multimodal understanding and generation model.


  • Running on Zero
    Agents
    161

    Chat With Janus 1.3B

    🌍
    161

    A unified multimodal understanding and generation model.


  • Qwen/Qwen2.5-VL-72B-Instruct

    Image-Text-to-Text • 73B • Updated Jun 6, 2025 • 105k • • 609

  • Qwen/Qwen2.5-VL-7B-Instruct

    Image-Text-to-Text • 8B • Updated Apr 6, 2025 • 8.93M • • 1.51k

  • nvidia/Cosmos-Reason1-7B

    Image-Text-to-Text • Updated Dec 10, 2025 • 73.7k • 240
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs