Spurious Rewards Paradox: Mechanistically Understanding How RLVR Activates Memorization Shortcuts in LLMs Paper • 2601.11061 • Published 5 days ago • 5