moondream2
a tiny vision language model
a tiny vision language model
MoonDream 2 Vision Model on the Browser: Candle/Rust/WASM
Generate text from images and prompts
Generate images from text prompts
Meta Llama3 8b with Llava Multimodal capabilities
Generate text and segment images using PaliGemma
Ask questions about images
Chat with an image using Phi-3 Vision model
Microsoft Phi-3 Vision 128k with Multimodal capabilities
let's talk about the meaning of life
Convert images to grayscale
Perform image captioning, detection, OCR and more with Florenceβ2
Generate detailed captions for any image
Generate detailed image captions
Analyze images to detect objects, generate captions, or perform OCR
A private and powerful multimodal AI chatbot that runs local
Generate images from prompts or images
Generate text based on an image and prompt
Ask questions about images
Generate answers to questions about any image
GOT - OCR (from : UCAS, Beijing)
Chat with Llama about images and text
Huggingface space for JanusFlow-1.3B
Generate text from images and queries
Generate captions for images