ShutterMuse: Capture-Time Photography Guidance with MLLMs Paper • 2606.25763 • Published 3 days ago • 38
LUCID: Learning Unified Control for Image Deflaring and Exposure Mastery in Nighttime Photography Paper • 2606.06901 • Published 22 days ago • 4
FreeStyle: Free Control of Style-Content Dual-Reference Generation from Community LoRA Mining Paper • 2606.20506 • Published 9 days ago • 28
ControlLight: Towards Controllable, Consistent, and Generalizable Low-Light Enhancement Paper • 2605.25569 • Published May 25 • 21
CutClaw: Agentic Hours-Long Video Editing via Music Synchronization Paper • 2603.29664 • Published Mar 31 • 51
GEditBench v2: A Human-Aligned Benchmark for General Image Editing Paper • 2603.28547 • Published Mar 30 • 32
RealRestorer: Towards Generalizable Real-World Image Restoration with Large-Scale Image Editing Models Paper • 2603.25502 • Published Mar 26 • 58
PixelSmile: Toward Fine-Grained Facial Expression Editing Paper • 2603.25728 • Published Mar 26 • 118
OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens Paper • 2603.02138 • Published Mar 2 • 151
Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters Paper • 2602.10604 • Published Feb 11 • 201
A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Doubao 1.8, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5 Paper • 2601.10527 • Published Jan 15 • 26
PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning Paper • 2601.05593 • Published Jan 9 • 87
VINO: A Unified Visual Generator with Interleaved OmniModal Context Paper • 2601.02358 • Published Jan 5 • 30
EditThinker: Unlocking Iterative Reasoning for Any Image Editor Paper • 2512.05965 • Published Dec 5, 2025 • 38