MIT/ast-finetuned-audioset-10-10-0.4593 Audio Classification • 86.6M • Updated Sep 6, 2023 • 911k • 345
unsloth/Qwen3-VL-2B-Instruct-GGUF Image-Text-to-Text • 2B • Updated Oct 31, 2025 • 27.6k • 25
Qwen3-VL Collection Qwen's new multimodal vision models in GGUF, safetensor, and dynamic Unsloth formats. • 56 items • Updated 9 days ago • 27
Running on Zero 15 Qwen3-VL Multimodal Search Engine 🔥 15 Cross-modal text-image search powered by Qwen3-VL
Ministral 3 Collection A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated Dec 2, 2025 • 159
MiniCPM-o & MiniCPM-V Collection Multimodal models with leading performance. • 29 items • Updated 18 days ago • 73