GrepSeek: Training Search Agents for Direct Corpus Interaction Paper • 2605.29307 • Published about 1 month ago • 115
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information Paper • 2605.11609 • Published May 12 • 196