VQQA: An Agentic Approach for Video Evaluation and Quality Improvement Paper • 2603.12310 • Published 3 days ago • 3
Spend Less, Reason Better: Budget-Aware Value Tree Search for LLM Agents Paper • 2603.12634 • Published 3 days ago
From Sparse to Dense: Multi-View GRPO for Flow Models via Augmented Condition Space Paper • 2603.12648 • Published 3 days ago • 3
One Model, Many Budgets: Elastic Latent Interfaces for Diffusion Transformers Paper • 2603.12245 • Published 3 days ago • 14
WeEdit: A Dataset, Benchmark and Glyph-Guided Framework for Text-centric Image Editing Paper • 2603.11593 • Published 4 days ago • 17
Multi-Task Reinforcement Learning for Enhanced Multimodal LLM-as-a-Judge Paper • 2603.11665 • Published 4 days ago • 3
Understanding by Reconstruction: Reversing the Software Development Process for LLM Pretraining Paper • 2603.11103 • Published 5 days ago • 7
Just-in-Time: Training-Free Spatial Acceleration for Diffusion Transformers Paper • 2603.10744 • Published 5 days ago • 6
CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR Paper • 2603.10101 • Published 5 days ago • 3
Code-Space Response Oracles: Generating Interpretable Multi-Agent Policies with Large Language Models Paper • 2603.10098 • Published 6 days ago • 1
V_{0.5}: Generalist Value Model as a Prior for Sparse RL Rollouts Paper • 2603.10848 • Published 5 days ago • 9
InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing Paper • 2603.09877 • Published 6 days ago • 41
MiniAppBench: Evaluating the Shift from Text to Interactive HTML Responses in LLM-Powered Assistants Paper • 2603.09652 • Published 6 days ago • 14
OfficeQA Pro: An Enterprise Benchmark for End-to-End Grounded Reasoning Paper • 2603.08655 • Published 7 days ago • 3