HiMPO: Hindsight-Informed Memory Policy Optimization for Less-Entangled Credit in Long-Horizon Agents Paper • 2606.16285 • Published 11 days ago • 1