ScaleEdit-12M: Scaling Open-Source Image Editing Data Generation via Multi-Agent Framework Paper • 2603.20644 • Published Mar 21 • 5 • 1
EditCaption: Human-Aligned Instruction Synthesis for Image Editing via Supervised Fine-Tuning and Direct Preference Optimization Paper • 2604.08213 • Published Apr 9 • 1 • 1
ImageEdit-R1: Boosting Multi-Agent Image Editing via Reinforcement Learning Paper • 2603.08059 • Published Mar 9 • 1 • 1
OARS: Process-Aware Online Alignment for Generative Real-World Image Super-Resolution Paper • 2603.12811 • Published Mar 13 • 1 • 1
Diff-Aid: Inference-time Adaptive Interaction Denoising for Rectified Text-to-Image Generation Paper • 2602.13585 • Published Feb 27 • 1 • 1
Coevolving Representations in Joint Image-Feature Diffusion Paper • 2604.17492 • Published 25 days ago • 5 • 3
Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills Paper • 2603.25158 • Published Mar 26 • 52 • 14
Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills Paper • 2603.25158 • Published Mar 26 • 52 • 14
Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills Paper • 2603.25158 • Published Mar 26 • 52 • 14
Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills Paper • 2603.25158 • Published Mar 26 • 52 • 14
Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills Paper • 2603.25158 • Published Mar 26 • 52 • 14
LUVE : Latent-Cascaded Ultra-High-Resolution Video Generation with Dual Frequency Experts Paper • 2602.11564 • Published Feb 12 • 1 • 1
Train Short, Inference Long: Training-free Horizon Extension for Autoregressive Video Generation Paper • 2602.14027 • Published Feb 15 • 1 • 1
VTok: A Unified Video Tokenizer with Decoupled Spatial-Temporal Latents Paper • 2602.04202 • Published Feb 4 • 1 • 1