SceneAligner: 3D-Grounded Floorplan Localization in the Wild Paper • 2605.22581 • Published 7 days ago • 6
Multi-Objective and Mixed-Reward Reinforcement Learning via Reward-Decorrelated Policy Optimization Paper • 2605.13641 • Published 15 days ago • 49
AEM: Adaptive Entropy Modulation for Multi-Turn Agentic Reinforcement Learning Paper • 2605.00425 • Published 20 days ago • 23
MNAFT: modality neuron-aware fine-tuning of multimodal large language models for image translation Paper • 2604.16943 • Published Apr 18 • 2
Locally Confident, Globally Stuck: The Quality-Exploration Dilemma in Diffusion Language Models Paper • 2604.00375 • Published Apr 1 • 5
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published Apr 3 • 630
HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents Paper • 2604.07430 • Published Apr 8 • 189
MegaStyle: Constructing Diverse and Scalable Style Dataset via Consistent Text-to-Image Style Mapping Paper • 2604.08364 • Published Apr 9 • 101
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 504
QiMeng-PRepair: Precise Code Repair via Edit-Aware Reward Optimization Paper • 2604.05963 • Published Apr 7 • 8
CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence Paper • 2603.28032 • Published Mar 30 • 342
PRBench: End-to-end Paper Reproduction in Physics Research Paper • 2603.27646 • Published Mar 29 • 29
Marco DeepResearch: Unlocking Efficient Deep Research Agents via Verification-Centric Design Paper • 2603.28376 • Published Mar 30 • 24
Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning Paper • 2603.04597 • Published Mar 4 • 211