DocScope-R1
🖤
168
Cosmos-R1 / docscopeOCR / Captioner-7B / visionOCR-3B
Cosmos-R1 / docscopeOCR / Captioner-7B / visionOCR-3B
Qwen Image LoRA's
Florence-2-large / Florence-2-base
OCR, VQA, Thinking and Object Detection.
High-accuracy vision & reasoning for complex tasks
Experiment with small super OCR models here.
Demo of a collection of Qwen3-VL models
Answer questions about images and videos
Generate custom captions, tags, or prompts for any image
Compare speech recognition models on benchmark scores