HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding Paper • 2601.14724 • Published 5 days ago • 71
Transition Matching Distillation for Fast Video Generation Paper • 2601.09881 • Published 11 days ago • 31
Alterbute: Editing Intrinsic Attributes of Objects in Images Paper • 2601.10714 • Published 11 days ago • 29
SnapGen++: Unleashing Diffusion Transformers for Efficient High-Fidelity Image Generation on Edge Devices Paper • 2601.08303 • Published 13 days ago • 16
MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head Paper • 2601.07832 • Published 14 days ago • 51
GenCtrl -- A Formal Controllability Toolkit for Generative Models Paper • 2601.05637 • Published 17 days ago • 4
Klear: Unified Multi-Task Audio-Video Joint Generation Paper • 2601.04151 • Published 19 days ago • 15
DreamStyle: A Unified Framework for Video Stylization Paper • 2601.02785 • Published 20 days ago • 24
LTX-2: Efficient Joint Audio-Visual Foundation Model Paper • 2601.03233 • Published 20 days ago • 135
LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation Paper • 2512.23576 • Published 28 days ago • 65
Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation Paper • 2601.00664 • Published 24 days ago • 55
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos Paper • 2601.00393 • Published 25 days ago • 128
Stream-DiffVSR: Low-Latency Streamable Video Super-Resolution via Auto-Regressive Diffusion Paper • 2512.23709 • Published 28 days ago • 49
Bridging Your Imagination with Audio-Video Generation via a Unified Director Paper • 2512.23222 • Published 28 days ago • 6
VA-π: Variational Policy Alignment for Pixel-Aware Autoregressive Generation Paper • 2512.19680 • Published Dec 22, 2025 • 11