From Natural Language to Extensive-Form Game Representations Paper • 2501.17282 • Published Jan 28, 2025 • 1
A Unified Framework for Rethinking Policy Divergence Measures in GRPO Paper • 2602.05494 • Published Feb 5 • 2
QUASAR: Quantum Assembly Code Generation Using Tool-Augmented LLMs via Agentic RL Paper • 2510.00967 • Published Oct 1, 2025 • 12