Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses Paper • 2606.02373 • Published 26 days ago • 57
RouteProfile: Elucidating the Design Space of LLM Profiles for Routing Paper • 2605.00180 • Published Apr 30 • 30