Sa2VA Simple Demo
🐨
54
Dense Grounded Understanding of Images and Videos
Dense Grounded Understanding of Images and Videos
Inpaint videos by masking and removing unwanted objects
Generate transcription and summary from uploaded videos
ColorFlow: Retrieval-Augmented Image Sequence Colorization
Create images from text and reference photos