LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models Paper • 2309.15103 • Published Sep 26, 2023 • 43
Latte: Latent Diffusion Transformer for Video Generation Paper • 2401.03048 • Published Jan 5, 2024 • 2
Cinemo: Consistent and Controllable Image Animation with Motion Diffusion Models Paper • 2407.15642 • Published Jul 22, 2024 • 11
Template Matters: Understanding the Role of Instruction Templates in Multimodal Language Model Evaluation and Training Paper • 2412.08307 • Published Dec 11, 2024
Training-free Stylized Text-to-Image Generation with Fast Inference Paper • 2505.19063 • Published May 25, 2025
EAvatar: Expression-Aware Head Avatar Reconstruction with Generative Geometry Priors Paper • 2508.13537 • Published Aug 19, 2025
HERO: Hierarchical Extrapolation and Refresh for Efficient World Models Paper • 2508.17588 • Published Aug 25, 2025 • 2
SceneDecorator: Towards Scene-Oriented Story Generation with Scene Planning and Scene Consistency Paper • 2510.22994 • Published Oct 27, 2025
A Gray-box Attack against Latent Diffusion Model-based Image Editing by Posterior Collapse Paper • 2408.10901 • Published Aug 20, 2024
OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation Paper • 2604.11804 • Published 3 days ago • 65