view article Article Reinforcement Learning for Large Language Models: Beyond the Agent Paradigm Mar 19, 2025 • 8