Generate video from an image and audio clip
wan 2.2 alibaba
Generate tags for images using Waifu Diffusion models
A Generalist Diffusion Model for Vision Perception