HiAR: Efficient Autoregressive Long Video Generation via Hierarchical Denoising Paper • 2603.08703 • Published 4 days ago • 29
Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence Paper • 2603.07660 • Published 5 days ago • 77
WildActor: Unconstrained Identity-Preserving Video Generation Paper • 2603.00586 • Published 14 days ago • 32
Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing Paper • 2603.03143 • Published 10 days ago • 136
TDM-R1: Reinforcing Few-Step Diffusion Models with Non-Differentiable Reward Paper • 2603.07700 • Published 5 days ago • 13
EchoTorrent: Towards Swift, Sustained, and Streaming Multi-Modal Video Generation Paper • 2602.13669 • Published 28 days ago • 2
S2DiT: Sandwich Diffusion Transformer for Mobile Streaming Video Generation Paper • 2601.12719 • Published Jan 19 • 1
DDiT: Dynamic Patch Scheduling for Efficient Diffusion Transformers Paper • 2602.16968 • Published 23 days ago • 12
OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens Paper • 2603.02138 • Published 11 days ago • 138
Moonshine: Speech Recognition for Live Transcription and Voice Commands Paper • 2410.15608 • Published Oct 21, 2024 • 11
JavisDiT++: Unified Modeling and Optimization for Joint Audio-Video Generation Paper • 2602.19163 • Published 19 days ago • 14