Vector Policy Optimization: Training for Diversity Improves Test-Time Search Paper • 2605.22817 • Published May 21 • 1
OmniGameArena: A Unified UE5 Benchmark for VLM Game Agents with Improvement Dynamics Paper • 2606.09826 • Published 17 days ago • 19
Economy of Minds: Emerging Multi-Agent Intelligence with Economic Interactions Paper • 2606.02859 • Published 24 days ago • 8
Economy of Minds: Emerging Multi-Agent Intelligence with Economic Interactions Paper • 2606.02859 • Published 24 days ago • 8
Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments Paper • 2605.30280 • Published 28 days ago • 146
Self-Improving Language Models with Bidirectional Evolutionary Search Paper • 2605.28814 • Published 29 days ago • 60
OSP-Next: Efficient High-Quality Video Generation with Sparse Sequence Parallelism, HiF8 Quantization, and Reinforcement Learning Paper • 2605.28691 • Published 29 days ago • 24
BES Collection Self-Improving Language Models with Bidirectional Evolutionary Search • 5 items • Updated 28 days ago • 3
Self-Improving Language Models with Bidirectional Evolutionary Search Paper • 2605.28814 • Published 29 days ago • 60
Self-Improving Language Models with Bidirectional Evolutionary Search Paper • 2605.28814 • Published 29 days ago • 60