Vector Policy Optimization: Training for Diversity Improves Test-Time Search Paper • 2605.22817 • Published May 21 • 1
OmniGameArena: A Unified UE5 Benchmark for VLM Game Agents with Improvement Dynamics Paper • 2606.09826 • Published 18 days ago • 19
Economy of Minds: Emerging Multi-Agent Intelligence with Economic Interactions Paper • 2606.02859 • Published 25 days ago • 8
Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments Paper • 2605.30280 • Published 29 days ago • 146
OSP-Next: Efficient High-Quality Video Generation with Sparse Sequence Parallelism, HiF8 Quantization, and Reinforcement Learning Paper • 2605.28691 • Published 30 days ago • 24
Self-Improving Language Models with Bidirectional Evolutionary Search Paper • 2605.28814 • Published 30 days ago • 61
BES Collection Self-Improving Language Models with Bidirectional Evolutionary Search • 5 items • Updated 29 days ago • 3
SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning Paper • 2603.23483 • Published Mar 24 • 63
iFSQ: Improving FSQ for Image Generation with 1 Line of Code Paper • 2601.17124 • Published Jan 23 • 34
One-step Latent-free Image Generation with Pixel Mean Flows Paper • 2601.22158 • Published Jan 29 • 18
Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning Paper • 2602.01058 • Published Feb 1 • 45
metaTextGrad: Automatically optimizing language model optimizers Paper • 2505.18524 • Published Oct 22, 2025 • 2
MENTOR: Mixture-of-Experts Network with Task-Oriented Perturbation for Visual Reinforcement Learning Paper • 2410.14972 • Published Oct 19, 2024 • 1
ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization Paper • 2402.14528 • Published Feb 22, 2024 • 1
Can Pre-Trained Text-to-Image Models Generate Visual Goals for Reinforcement Learning? Paper • 2307.07837 • Published Jul 15, 2023 • 1