Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm Paper • 2511.04570 • Published Nov 6 • 210
LatentSync: Audio Conditioned Latent Diffusion Models for Lip Sync Paper • 2412.09262 • Published Dec 12, 2024 • 1