HumanNet: Scaling Human-centric Video Learning to One Million Hours Paper • 2605.06747 • Published 5 days ago • 40
Efficient Training on Multiple Consumer GPUs with RoundPipe Paper • 2604.27085 • Published 13 days ago • 40
Enhancing Spatial Understanding in Image Generation via Reward Modeling Paper • 2602.24233 • Published Feb 27 • 60
Light Forcing: Accelerating Autoregressive Video Diffusion via Sparse Attention Paper • 2602.04789 • Published Feb 4 • 3
MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head Paper • 2601.07832 • Published Jan 12 • 52