A Matter of TASTE: Improving Coverage and Difficulty of Agent Benchmarks Paper • 2605.28556 • Published 7 days ago • 55
On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters Paper • 2606.02437 • Published 2 days ago • 88
Crafter: A Multi-Agent Harness for Editable Scientific Figure Generation from Diverse Inputs Paper • 2605.30611 • Published 6 days ago • 112
When Should Models Change Their Minds? Contextual Belief Management in Large Language Models Paper • 2605.30219 • Published 6 days ago • 22
LoMo: Local Modality Substitution for Deeper Vision-Language Fusion Paper • 2605.30265 • Published 6 days ago • 21
AgentDoG 1.5: A Lightweight and Scalable Alignment Framework for AI Agent Safety and Security Paper • 2605.29801 • Published 6 days ago • 137
Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments Paper • 2605.30280 • Published 6 days ago • 129
From Pixels to Words -- Towards Native One-Vision Models at Scale Paper • 2605.28820 • Published 7 days ago • 70
Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players Paper • 2605.28816 • Published 7 days ago • 417
SpatialBench: Is Your Spatial Foundation Model an All-Round Player? Paper • 2605.27367 • Published 8 days ago • 70
LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding Paper • 2605.27365 • Published 8 days ago • 134
Macaron-A2UI: A Model for Generative UI in Personal Agents Paper • 2605.24830 • Published 10 days ago • 80
DVAO: Dynamic Variance-adaptive Advantage Optimization for Multi-reward Reinforcement Learning Paper • 2605.25604 • Published 9 days ago • 133
Rethinking Cross-Layer Information Routing in Diffusion Transformers Paper • 2605.20708 • Published 14 days ago • 109
Lens: Rethinking Training Efficiency for Foundational Text-to-Image Models Paper • 2605.21573 • Published 14 days ago • 107
SkillOpt: Executive Strategy for Self-Evolving Agent Skills Paper • 2605.23904 • Published 12 days ago • 216
PhysX-Omni: Unified Simulation-Ready Physical 3D Generation for Rigid, Deformable, and Articulated Objects Paper • 2605.21572 • Published 14 days ago • 52
Full Attention Strikes Back: Transferring Full Attention into Sparse within Hundred Training Steps Paper • 2605.16928 • Published 18 days ago • 93
ACC: Compiling Agent Trajectories for Long-Context Training Paper • 2605.21850 • Published 13 days ago • 58