view article Article Training LLM Agents to Act Under Adversarial Evidence with Multi-Reward Dual-Control RL 3 days ago • 1
view article Article Training LLM Agents to Act Under Adversarial Evidence with Multi-Reward Dual-Control RL 3 days ago • 1
PhysRVG: Physics-Aware Unified Reinforcement Learning for Video Generative Models Paper • 2601.11087 • Published 11 days ago • 11
view article Article M2.1: Multilingual and Multi-Task Coding with Strong Generalization 22 days ago • 37
ArenaRL: Scaling RL for Open-Ended Agents via Tournament-based Relative Ranking Paper • 2601.06487 • Published 17 days ago • 50
CausalARC: Abstract Reasoning with Causal World Models Paper • 2509.03636 • Published Sep 3, 2025 • 1