Add objects to images using text prompts
Search audio for relevant chunks
3D Mesh Generation via Compositional Latent Diffusion
Detect objects in images and videos
Detect, segment, classify objects in images and videos