Idefics3
Generate text based on an image and prompt
Generate text based on an image and prompt
Media understanding
Identify objects in images using text queries
Generate text and segment images using PaliGemma
Annotate and describe images with text prompts
Segment objects in images or videos using text prompts
Analyze images to caption, detect objects, and extract text
Generate detailed image analyses and depth predictions
Generate detailed descriptions from images and questions
Chat about images and get instant answers
Chat with an AI that understands images and text
Generate captions, detections, and segmentations for any image
Ask questions about images and get detailed answers
Chat with images using MiniGPT-4
Chat with Pixtral 12B using Mistral Inference
Interact with a chatbot that understands text and images
State-of-the-art Zero-shot Object Detection
Chat with Llama about images and text
Answer questions about images with AI chat
Generate text responses based on images and chat history
Paligemma2 Detection with Supervision
Generate text responses from images and text input
Visualize image depth, segmentation, and generation
A unified multimodal understanding and generation model.