UniAudio 2.0: A Unified Audio Language Model with Text-Aligned Factorized Audio Tokenization Paper • 2602.04683 • Published Feb 4 • 3
Motion 3-to-4: 3D Motion Reconstruction for 4D Synthesis Paper • 2601.14253 • Published Jan 20 • 10
FlowAct-R1: Towards Interactive Humanoid Video Generation Paper • 2601.10103 • Published Jan 15 • 77
OmniTransfer: All-in-one Framework for Spatio-temporal Video Transfer Paper • 2601.14250 • Published Jan 20 • 47
VideoMaMa: Mask-Guided Video Matting via Generative Prior Paper • 2601.14255 • Published Jan 20 • 15