InvokeAI/ip_adapter_sd_image_encoder
0.6B • Updated • 7.21k • 12
a tiny vision language model
Generate detailed Stable Diffusion prompts from any image
Generate detailed prompts from any image
Meta Llama3 8b with Llava Multimodal capabilities
Launch an interactive web interface for the tool
Convert GUI screen to structured elements
Generate detailed image prompts for AI art
Generate detailed image descriptions
Ask questions about images and get detailed answers
Generate customized images from text and reference photos
Convert floorplan images to vector graphics