Co-Evolution of Policy and Internal Reward for Language Agents Paper • 2604.03098 • Published Apr 3 • 1