AnyRecon: Arbitrary-View 3D Reconstruction with Video Diffusion Model Paper • 2604.19747 • Published 21 days ago • 39
zhiyuanyou/Qwen2.5-VL-7B-GRPO-Composition-Score-Class Image-Text-to-Text • 8B • Updated 25 days ago • 54
zhiyuanyou/Qwen2.5-VL-7B-GRPO-Composition-Score-Class Image-Text-to-Text • 8B • Updated 25 days ago • 54
Diversity-Preserved Distribution Matching Distillation for Fast Visual Synthesis Paper • 2602.03139 • Published Feb 3 • 44
RouteMoA: Dynamic Routing without Pre-Inference Boosts Efficient Mixture-of-Agents Paper • 2601.18130 • Published Jan 26 • 2
UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture Paper • 2512.21675 • Published Dec 25, 2025 • 28
ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents Paper • 2507.22827 • Published Jul 30, 2025 • 101
VisualQuality-R1: Reasoning-Induced Image Quality Assessment via Reinforcement Learning to Rank Paper • 2505.14460 • Published May 20, 2025 • 33