Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses Paper • 2606.02373 • Published 3 days ago • 35
s3: You Don't Need That Much Data to Train a Search Agent via RL Paper • 2505.14146 • Published May 20, 2025 • 20