LongVie 2: Multimodal Controllable Ultra-Long Video World Model Paper โข 2512.13604 โข Published Dec 15, 2025 โข 74
Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance Paper โข 2512.08765 โข Published Dec 9, 2025 โข 133
PICABench: How Far Are We from Physically Realistic Image Editing? Paper โข 2510.17681 โข Published Oct 20, 2025 โข 64
Routing Matters in MoE: Scaling Diffusion Transformers with Explicit Routing Guidance Paper โข 2510.24711 โข Published Oct 28, 2025 โข 20
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs Paper โข 2510.11696 โข Published Oct 13, 2025 โข 181
Training-Free Efficient Video Generation via Dynamic Token Carving Paper โข 2505.16864 โข Published May 22, 2025 โข 24
STEVE: AStep Verification Pipeline for Computer-use Agent Training Paper โข 2503.12532 โข Published Mar 16, 2025 โข 17
TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models Paper โข 2503.05638 โข Published Mar 7, 2025 โข 20
RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers Paper โข 2502.15894 โข Published Feb 21, 2025 โข 20
MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation Paper โข 2502.04299 โข Published Feb 6, 2025 โข 18
Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition Paper โข 2412.09501 โข Published Dec 12, 2024 โข 48
ControlNeXt: Powerful and Efficient Control for Image and Video Generation Paper โข 2408.06070 โข Published Aug 12, 2024 โข 55
Real-World Image Variation by Aligning Diffusion Inversion Chain Paper โข 2305.18729 โข Published May 30, 2023 โข 5
Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance Paper โข 2306.00943 โข Published Jun 1, 2023 โข 6
DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors Paper โข 2310.12190 โข Published Oct 18, 2023 โข 13