vision models - a segmond Collection

segmond 's Collections

pending_space_downloads

Segmond Interests

pending_downloads

training examples

embedding models

vision models

updated Oct 1, 2025

bartowski/UI-TARS-7B-DPO-GGUF

Image-Text-to-Text • 8B • Updated Jan 23, 2025 • 18.9k • 10
bartowski/UI-TARS-72B-SFT-GGUF

Image-Text-to-Text • 73B • Updated Jan 24, 2025 • 1.47k • 1
bartowski/UI-TARS-7B-SFT-GGUF

Image-Text-to-Text • 8B • Updated Jan 24, 2025 • 1.28k • 3
bartowski/UI-TARS-72B-DPO-GGUF

Image-Text-to-Text • 73B • Updated Jan 23, 2025 • 39.7k • 3
bartowski/allenai_olmOCR-7B-0225-preview-GGUF

Image-Text-to-Text • 8B • Updated Feb 25, 2025 • 718 • 7
microsoft/Phi-4-multimodal-instruct

Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 509k • 1.61k
ggml-org/ultravox-v0_5-llama-3_2-1b-GGUF

Audio-Text-to-Text • 1B • Updated May 25, 2025 • 3.17k • 7
mradermacher/Qwen2-Audio-7B-Instruct-GGUF

Audio-Text-to-Text • 8B • Updated Jul 31, 2025 • 548
city96/FLUX.1-dev-gguf

Text-to-Image • 12B • Updated Aug 18, 2024 • 125k • 1.37k
openbmb/MiniCPM-V-4_5

Image-Text-to-Text • 9B • Updated Mar 10 • 86.7k • 1.09k
Qwen/Qwen-Image-Edit

Image-to-Image • Updated Aug 25, 2025 • 70.9k • • 2.44k