Generate video from an image and audio
wan 2.2 alibaba
Generate tags for images using Waifu Diffusion models
A Generalist Diffusion Model for Vision Perception