LongVideoAgent: Multi-Agent Reasoning with Long Videos Paper âĒ 2512.20618 âĒ Published Dec 23, 2025 âĒ 54
Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing Paper âĒ 2510.19808 âĒ Published Oct 22, 2025 âĒ 30
Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance Paper âĒ 2512.08765 âĒ Published Dec 9, 2025 âĒ 132
MultiShotMaster: A Controllable Multi-Shot Video Generation Framework Paper âĒ 2512.03041 âĒ Published Dec 2, 2025 âĒ 63
Can World Simulators Reason? Gen-ViRe: A Generative Visual Reasoning Benchmark Paper âĒ 2511.13853 âĒ Published Nov 17, 2025 âĒ 35
MultiBooth: Towards Generating All Your Concepts in an Image from Text Paper âĒ 2404.14239 âĒ Published Apr 22, 2024 âĒ 9