Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling Paper • 2605.13301 • Published 10 days ago • 154
MAMMAL -- Molecular Aligned Multi-Modal Architecture and Language Paper • 2410.22367 • Published Oct 28, 2024 • 1
KARMA: Leveraging Multi-Agent LLMs for Automated Knowledge Graph Enrichment Paper • 2502.06472 • Published Feb 10, 2025 • 3
AnyFlow: Any-Step Video Diffusion Model with On-Policy Flow Map Distillation Paper • 2605.13724 • Published 10 days ago • 96
AI Co-Mathematician: Accelerating Mathematicians with Agentic AI Paper • 2605.06651 • Published 16 days ago • 15
Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation Paper • 2604.24763 • Published 26 days ago • 71
Enhancing Spatial Understanding in Image Generation via Reward Modeling Paper • 2602.24233 • Published Feb 27 • 60
ScaleEdit-12M: Scaling Open-Source Image Editing Data Generation via Multi-Agent Framework Paper • 2603.20644 • Published Mar 21 • 5
EditCaption: Human-Aligned Instruction Synthesis for Image Editing via Supervised Fine-Tuning and Direct Preference Optimization Paper • 2604.08213 • Published Apr 9 • 1
ImageEdit-R1: Boosting Multi-Agent Image Editing via Reinforcement Learning Paper • 2603.08059 • Published Mar 9 • 1
OARS: Process-Aware Online Alignment for Generative Real-World Image Super-Resolution Paper • 2603.12811 • Published Mar 13 • 1
EditHF-1M: A Million-Scale Rich Human Preference Feedback for Image Editing Paper • 2603.14916 • Published Mar 16 • 2
Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and Generation Paper • 2603.12247 • Published Mar 12 • 24
Diff-Aid: Inference-time Adaptive Interaction Denoising for Rectified Text-to-Image Generation Paper • 2602.13585 • Published Feb 27 • 1
Omni-Video 2: Scaling MLLM-Conditioned Diffusion for Unified Video Generation and Editing Paper • 2602.08820 • Published Feb 9 • 1
DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing Paper • 2602.12205 • Published Feb 12 • 83