view article Article One-Sentence Image Matting! DiffSynth Open Sources Text-Guided Image Layer Separation Model Jan 14 β’ 3
TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs Paper β’ 2512.14698 β’ Published Dec 16, 2025 β’ 21
Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation Paper β’ 2512.04678 β’ Published Dec 4, 2025 β’ 42
MagicQuillV2: Precise and Interactive Image Editing with Layered Visual Cues Paper β’ 2512.03046 β’ Published Dec 2, 2025 β’ 12
Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset Paper β’ 2510.15742 β’ Published Oct 17, 2025 β’ 51
EditVerse: Unifying Image and Video Editing and Generation with In-Context Learning Paper β’ 2509.20360 β’ Published Sep 24, 2025 β’ 18
UMO: Scaling Multi-Identity Consistency for Image Customization via Matching Reward Paper β’ 2509.06818 β’ Published Sep 8, 2025 β’ 29
PosterCraft: Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework Paper β’ 2506.10741 β’ Published Jun 12, 2025 β’ 27
Calligrapher: Freestyle Text Image Customization Paper β’ 2506.24123 β’ Published Jun 30, 2025 β’ 37
ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow Development Paper β’ 2506.05010 β’ Published Jun 5, 2025 β’ 80
OmniSVG: A Unified Scalable Vector Graphics Generation Model Paper β’ 2504.06263 β’ Published Apr 8, 2025 β’ 183
MangaNinja: Line Art Colorization with Precise Reference Following Paper β’ 2501.08332 β’ Published Jan 14, 2025 β’ 62
UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics Paper β’ 2412.07774 β’ Published Dec 10, 2024 β’ 30
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding Paper β’ 2312.04461 β’ Published Dec 7, 2023 β’ 62
LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis Paper β’ 2412.15214 β’ Published Dec 19, 2024 β’ 15
MagicQuill: An Intelligent Interactive Image Editing System Paper β’ 2411.09703 β’ Published Nov 14, 2024 β’ 80