VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis Paper • 2403.08764 • Published Mar 13, 2024 • 36 • 6
The Power of Sound (TPoS): Audio Reactive Video Generation with Stable Diffusion Paper • 2309.04509 • Published Sep 8, 2023 • 1 • 1