The Consistency Critic: Correcting Inconsistencies in Generated Images via Reference-Guided Attentive Alignment Paper โข 2511.20614 โข Published Nov 25, 2025 โข 38
TempSamp-R1: Effective Temporal Sampling with Reinforcement Fine-Tuning for Video LLMs Paper โข 2509.18056 โข Published Sep 22, 2025 โข 27
Emerging Properties in Unified Multimodal Pretraining Paper โข 2505.14683 โข Published May 20, 2025 โข 133
Running on Zero Featured 610 StoryDiffusion ๐ 610 Generate images from text prompts with optional reference images
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation Paper โข 2405.01434 โข Published May 2, 2024 โข 56
Running on Zero Featured 610 StoryDiffusion ๐ 610 Generate images from text prompts with optional reference images
Running on Zero Featured 1.94k PhotoMaker ๐ท 1.94k Generate customized realistic human photos from images and text prompts