DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing Paper • 2602.12205 • Published 29 days ago • 79
UniReason 1.0: A Unified Reasoning Framework for World Knowledge Aligned Image Generation and Editing Paper • 2602.02437 • Published Feb 2 • 77
UniREditBench: A Unified Reasoning-based Image Editing Benchmark Paper • 2511.01295 • Published Nov 3, 2025 • 39
MoIIE: Mixture of Intra- and Inter-Modality Experts for Large Vision Language Models Paper • 2508.09779 • Published Aug 13, 2025
GeometryZero: Improving Geometry Solving for LLM with Group Contrastive Policy Optimization Paper • 2506.07160 • Published Jun 8, 2025 • 3
Activating Distributed Visual Region within LLMs for Efficient and Effective Vision-Language Training and Inference Paper • 2412.12785 • Published Dec 17, 2024
Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better Paper • 2506.09040 • Published Jun 10, 2025 • 34