Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
shobbs 's Collections
Mobile use aka smart phone actions dataset
papers
storytime
embed RAG
think and learn
small and fast
NSFW
bio
vision
video llm llava
image art
arm

vision

updated Dec 29, 2025
Upvote
-

  • google/paligemma2-28b-pt-896

    Image-Text-to-Text • 28B • Updated Dec 5, 2024 • 323 • 51

  • lmstudio-community/olmOCR-7B-0225-preview-GGUF

    Image-Text-to-Text • 8B • Updated Feb 25, 2025 • 205 • 12

  • vidore/colqwen2.5-v0.2

    Visual Document Retrieval • Updated Jun 16, 2025 • 15.6k • 95

  • vidore/colpali-v1.3

    Visual Document Retrieval • Updated Mar 14, 2025 • 33.3k • 88

  • vidore/colSmol-500M

    Visual Document Retrieval • Updated Mar 14, 2025 • 593 • 21

  • deepseek-ai/deepseek-vl2

    Image-Text-to-Text • 27B • Updated Dec 18, 2024 • 3.53k • 379

  • Running on Zero
    5

    gen2seg: Generative Models Enable Generalizable Instance Segmentation

    🚀
    5

    A demo of our gen2seg SD and MAE-H models.


  • nvidia/NitroGen

    Reinforcement Learning • Updated 7 days ago • 496

  • naver-hyperclovax/HyperCLOVAX-SEED-Omni-8B

    Text Generation • 11B • Updated Jan 6 • 941 • 180
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs