EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions Paper β’ 2402.17485 β’ Published Feb 27, 2024 β’ 194
ChatMusician: Understanding and Generating Music Intrinsically with LLM Paper β’ 2402.16153 β’ Published Feb 25, 2024 β’ 57
Macaw-LLM: Multi-Modal Language Modeling with Image, Audio, Video, and Text Integration Paper β’ 2306.09093 β’ Published Jun 15, 2023 β’ 16