Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
CKeibel
's Collections
SLMs
PII
Code-Embeddings
Speech2Text (ASR)
Seq2Seq
Reward Models
diffusion models
Text-Classification
Data
PEFT (Papers)
LLMs (Papers)
Causal LMs, seq2seq models
Embedding models
Vision stuff
datasets
NER
BERT based tasks (models)
Multimodal
Multimodal
updated
Apr 15, 2025
Upvote
-
HuggingFaceM4/idefics-80b-instruct
Text Generation
•
80B
•
Updated
Oct 12, 2023
•
3.1k
•
189
liuhaotian/llava-v1.5-13b
Image-Text-to-Text
•
Updated
May 9, 2024
•
20k
•
523
llava-hf/llava-v1.6-34b-hf
Image-Text-to-Text
•
35B
•
Updated
Jan 27, 2025
•
2.77k
•
93
HuggingFaceM4/idefics2-8b
Image-Text-to-Text
•
8B
•
Updated
Oct 14, 2024
•
11.9k
•
620
microsoft/Phi-3-vision-128k-instruct
Text Generation
•
4B
•
Updated
Dec 10, 2025
•
59k
•
970
google/paligemma-3b-pt-224
Image-Text-to-Text
•
3B
•
Updated
Sep 21, 2024
•
23.8k
•
402
jinaai/jina-clip-v1
Feature Extraction
•
0.2B
•
Updated
May 20, 2025
•
103k
•
256
Qwen/Qwen2-VL-2B-Instruct
Image-Text-to-Text
•
2B
•
Updated
Jan 12, 2025
•
1.7M
•
483
llamaindex/vdr-2b-multi-v1
Image-to-Text
•
2B
•
Updated
May 21, 2025
•
1.41k
•
124
Upvote
-
Share collection
View history
Collection guide
Browse collections