DocScope-R1
🫓
168
Cosmos-R1 / docscopeOCR / Captioner-7B / visionOCR-3B
Cosmos-R1 / docscopeOCR / Captioner-7B / visionOCR-3B
Qwen Image LoRA's
Florence-2-large / Florence-2-base
OCR, VQA, Thinking and Object Detection.
High-accuracy vision & reasoning for complex tasks
Experiment with small super OCR models here.
Demo of a collection of Qwen3-VL models
Analyze images and videos with detailed reasoning responses
Generate detailed captions for any image