TriPlay-RL: Tri-Role Self-Play Reinforcement Learning for LLM Safety Alignment Paper • 2601.18292 • Published 6 days ago • 10
FABLE: Forest-Based Adaptive Bi-Path LLM-Enhanced Retrieval for Multi-Document Reasoning Paper • 2601.18116 • Published 7 days ago • 11
Efficient Switchable Safety Control in LLMs via Magic-Token-Guided Co-Training Paper • 2508.14904 • Published Aug 12, 2025 • 2
Evaluation is All You Need: Strategic Overclaiming of LLM Reasoning Capabilities Through Evaluation Design Paper • 2506.04734 • Published Jun 5, 2025 • 21
TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation Paper • 2503.04872 • Published Mar 6, 2025 • 15