Space for the preference study for my thesis.
Generate videos from text prompts and optional images
Generate audio from text, video, or audio prompts
A Step Towards Music Generation Foundation Model