SmolVLM 256M Instruct WebGPU
🐨
72
Generate captions for your images instantly
Generate captions for your images instantly
Remove background from images and videos
Generate text answers or segment objects from images
Generate answers by combining text and images
Compare SigLIP1 and SigLIP2 on zero shot classification
Detect and segment objects in images using text, visual, or prompt-free prompts
Generate depth map from your photo
Upscale photos to any size with neural super‑resolution
Precise Background Preservation in Editing
Edit an image based on the given instruction.
Extreme Super-Resolution via Scale Autoregression
Generate images from text descriptions
THUDM/GLM-4.1V-9B-Thinking Demo
Generate custom images with style and subject references
Explore object detection, visual grounding, keypoint Detecti