Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
afondiel 's Collections
Computer Vision Challenge
Edge-AI
Vision
Audio
Video
Autonomous Systems
Cultural AI
Language
Multimodality

Vision

updated Oct 23, 2024
Upvote
-

  • An-619/FastSAM

    Updated Jun 22, 2023 • 62

  • black-forest-labs/FLUX.1-dev

    Text-to-Image • Updated Jun 27, 2025 • 1.1M • • 13.4k

  • black-forest-labs/FLUX.1-schnell

    Text-to-Image • Updated Aug 16, 2024 • 210k • • 5.21k

  • google/owlvit-base-patch32

    Zero-Shot Object Detection • 0.2B • Updated Dec 12, 2023 • 175k • 149

  • openai/clip-vit-base-patch32

    Zero-Shot Image Classification • Updated Feb 29, 2024 • 23.2M • 963

  • llava-hf/vip-llava-7b-hf

    Image-Text-to-Text • 7B • Updated Jan 27, 2025 • 975 • 16

  • mistral-community/pixtral-12b-240910

    Image-Text-to-Text • Updated Oct 1, 2024 • 1.76k • 381

  • microsoft/Phi-3-vision-128k-instruct

    Text Generation • 4B • Updated Dec 10, 2025 • 252k • 970
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs