HumanNet: Scaling Human-centric Video Learning to One Million Hours Paper • 2605.06747 • Published 4 days ago • 33
Efficient Training on Multiple Consumer GPUs with RoundPipe Paper • 2604.27085 • Published 12 days ago • 40
Enhancing Spatial Understanding in Image Generation via Reward Modeling Paper • 2602.24233 • Published Feb 27 • 60
Light Forcing: Accelerating Autoregressive Video Diffusion via Sparse Attention Paper • 2602.04789 • Published Feb 4 • 3
MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head Paper • 2601.07832 • Published Jan 12 • 52