Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies Paper • 2512.19673 • Published Dec 22, 2025 • 64
RedOne: Revealing Domain-specific LLM Post-Training in Social Networking Services Paper • 2507.10605 • Published Jul 13, 2025 • 9 • 2