Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
afondiel
's Collections
Computer Vision Challenge
Edge-AI
Vision
Audio
Video
Autonomous Systems
Cultural AI
Language
Multimodality
Vision
updated
Oct 23, 2024
Upvote
-
Sort: Collection
An-619/FastSAM
Updated
Jun 22, 2023
•
62
black-forest-labs/FLUX.1-dev
Text-to-Image
•
Updated
Jun 27, 2025
•
1.1M
•
•
13.4k
black-forest-labs/FLUX.1-schnell
Text-to-Image
•
Updated
Aug 16, 2024
•
210k
•
•
5.21k
google/owlvit-base-patch32
Zero-Shot Object Detection
•
0.2B
•
Updated
Dec 12, 2023
•
175k
•
149
openai/clip-vit-base-patch32
Zero-Shot Image Classification
•
Updated
Feb 29, 2024
•
23.2M
•
963
llava-hf/vip-llava-7b-hf
Image-Text-to-Text
•
7B
•
Updated
Jan 27, 2025
•
975
•
16
mistral-community/pixtral-12b-240910
Image-Text-to-Text
•
Updated
Oct 1, 2024
•
1.76k
•
381
microsoft/Phi-3-vision-128k-instruct
Text Generation
•
4B
•
Updated
Dec 10, 2025
•
252k
•
970
Upvote
-
Sort: Collection
Share collection
View history
Collection guide
Browse collections