PolaroidVL 1.0 Demo
π
243
a mini vision-language ai model
Generate images from text using Latent Diffusion
Remove backgrounds from images
Segment images using texts, points, or everything mode
Segment objects in images using text prompts or scribbles
Classify images using zero-shot classification
Generate depth map from an image
Generate captions for images using multiple models