Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
seraph9999 's Collections
XXX
Image-to-Video
Text-to-Image
LLM
VLM
Text-to-Video
ASR
Embedding
Multi-Modal
Forecasting

VLM

updated Feb 3, 2025
Upvote
-

  • LargeWorldModel/LWM-Chat-1M-Jax

    Updated Feb 12, 2024 • 125

  • google/paligemma-3b-pt-896

    Image-Text-to-Text • 3B • Updated Jun 22, 2025 • 199 • 123

  • zai-org/glm-4v-9b

    14B • Updated Mar 3, 2025 • 32.2k • 266

  • facebook/sam2-hiera-large-hf

    0.2B • Updated Aug 8, 2024 • 45 • 9

  • microsoft/Florence-2-large

    Image-Text-to-Text • 0.8B • Updated Aug 4, 2025 • 713k • 1.74k

  • meta-llama/Llama-3.2-11B-Vision-Instruct

    Image-Text-to-Text • 11B • Updated Dec 4, 2024 • 160k • • 1.56k

  • deepseek-ai/Janus-Pro-7B

    Any-to-Any • Updated Feb 1, 2025 • 23.7k • 3.55k
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs