HumanNet: Scaling Human-centric Video Learning to One Million Hours Paper • 2605.06747 • Published 15 days ago • 51
Efficient Training on Multiple Consumer GPUs with RoundPipe Paper • 2604.27085 • Published 23 days ago • 40
Efficient Training on Multiple Consumer GPUs with RoundPipe Paper • 2604.27085 • Published 23 days ago • 40
Enhancing Spatial Understanding in Image Generation via Reward Modeling Paper • 2602.24233 • Published Feb 27 • 60
Focal Guidance: Unlocking Controllability from Semantic-Weak Layers in Video Diffusion Models Paper • 2601.07287 • Published Jan 12 • 6
MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head Paper • 2601.07832 • Published Jan 12 • 53
MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head Paper • 2601.07832 • Published Jan 12 • 53
MAGREF: Masked Guidance for Any-Reference Video Generation Paper • 2505.23742 • Published May 29, 2025 • 11