Agent+RL - a leqing Collection

leqing 's Collections

thinking shorter

Agent+RL

updated May 22, 2025

Web-Shepherd: Advancing PRMs for Reinforcing Web Agents

Paper • 2505.15277 • Published May 21, 2025 • 105
Efficient Agent Training for Computer Use

Paper • 2505.13909 • Published May 20, 2025 • 44