Nudging Beyond the Comfort Zone: Efficient Strategy-Guided Exploration for RLVR Paper • 2605.15726 • Published 4 days ago • 29
Nudging Beyond the Comfort Zone: Efficient Strategy-Guided Exploration for RLVR Paper • 2605.15726 • Published 4 days ago • 29
Heterogeneous Scientific Foundation Model Collaboration Paper • 2604.27351 • Published 19 days ago • 215
Memory Transfer Learning: How Memories are Transferred Across Domains in Coding Agents Paper • 2604.14004 • Published Apr 15 • 30