CaveAgent: Transforming LLMs into Stateful Runtime Operators Paper • 2601.01569 • Published Jan 4 • 20
GoRL: An Algorithm-Agnostic Framework for Online Reinforcement Learning with Generative Policies Paper • 2512.02581 • Published Dec 2, 2025 • 15