LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios Paper • 2310.08348 • Published Oct 12, 2023 • 4
SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law Paper • 2507.18576 • Published Jul 24, 2025 • 10
ReZero: Boosting MCTS-based Algorithms by Backward-view and Entire-buffer Reanalyze Paper • 2404.16364 • Published Apr 25, 2024 • 1
One Model for All Tasks: Leveraging Efficient World Models in Multi-Task Planning Paper • 2509.07945 • Published Sep 9, 2025 • 1
ReZero: Boosting MCTS-based Algorithms by Backward-view and Entire-buffer Reanalyze Paper • 2404.16364 • Published Apr 25, 2024 • 1
MetaphorStar: Image Metaphor Understanding and Reasoning with End-to-End Visual Reinforcement Learning Paper • 2602.10575 • Published Feb 11 • 4
One Model for All Tasks: Leveraging Efficient World Models in Multi-Task Planning Paper • 2509.07945 • Published Sep 9, 2025 • 1