Rethinking Video Generation Model for the Embodied World Paper • 2601.15282 • Published 28 days ago • 43
Focal Guidance: Unlocking Controllability from Semantic-Weak Layers in Video Diffusion Models Paper • 2601.07287 • Published Jan 12 • 5
MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head Paper • 2601.07832 • Published Jan 12 • 52
MAGREF: Masked Guidance for Any-Reference Video Generation Paper • 2505.23742 • Published May 29, 2025 • 11