Images to Text - a ijohn07 Collection

ijohn07 's Collections

Text to images NSFW

Justines's Llamafiles

Images to Text

updated Mar 2

Running

Agents

443

moondream2

🌔

443

a tiny vision language model
Running

Featured

37

Candle Moondream 2

🕯

37

MoonDream 2 Vision Model on the Browser: Candle/Rust/WASM
Paused

Agents

Featured

146

Idefics 8b

🐠

146

Generate text from images and prompts
Running on Zero

Agents

89

Llava Llama-3 8B

🔥

89

Meta Llama3 8b with Llava Multimodal capabilities
Runtime error

Agents

85

Paligemma HF

🤗

85

Generate text and segment images using PaliGemma
Running on Zero

Agents

Featured

151

Llava Next

🔥

151

Chat with an AI about any uploaded image
Running on Zero

Agents

Featured

219

Microsoft Phi-3-Vision-128k

😻

219

Chat with an image using Phi-3 Vision model
Sleeping

Agents

47

Microsoft Phi-3 Vision 128k

🔥

47

Microsoft Phi-3 Vision 128k with Multimodal capabilities
Running on Zero

Agents

Featured

51

Contemplative moondream

🌜

51

let's talk about the meaning of life
Running

3

Gradio Lite

🖼

3

Convert images to grayscale
Running on Zero

Agents

Featured

845

Florence 2

📉

845

Generate captions, detections, and segmentations from images
Running on Zero

Agents

Featured

259

SD3 Long Captioner

🏃

259

Generate detailed captions for your images
Running

Agents

38

Florence 2 SD3 Captioner

⚡

38

Generate detailed captions for any image
Running on Zero

Agents

Featured

197

Better Florence 2

🔥

197

Analyze images to detect objects, generate captions, or perform OCR
Running

23

LLaVA WebGPU

🌋

23

A private and powerful multimodal AI chatbot that runs local
Running on Zero

Agents

90

AuraFlow-v0.3 with Captioner

🖼

90

Generate images from captions or enhanced prompts
Paused

Agents

Featured

102

Idefics3

📊

102

Generate text based on an image and prompt
Running on Zero

Agents

31

Phi 3.5 Vision

👁

31

Ask questions about images
Running on Zero

Agents

Featured

227

Phi 3.5 Vision

🔥

227

Ask questions about images and get detailed answers
Build error

MCP

Featured

178

Tonic's GOT OCR

📲

178

GOT - OCR (from : UCAS, Beijing)
Runtime error

Agents

Featured

390

Llama-Vision-11B

🚀

390

Chat with Llama about images and text
Running on Zero

Agents

Featured

220

JanusFlow 1.3B

🏃

220

Huggingface space for JanusFlow-1.3B
Running on Zero

Agents

143

SmolVLM

📊

143

Answer questions about images with AI chat
Sleeping

Agents

1

SD3 Long Captioner

🏃

1

Generate captions for images