Scaling Reinforcement Learning for Content Moderation with Large Language Models Paper • 2512.20061 • Published Dec 23, 2025