Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
leqing
's Collections
thinking shorter
Agent+RL
Agent+RL
updated
May 22
Upvote
-
Web-Shepherd: Advancing PRMs for Reinforcing Web Agents
Paper
•
2505.15277
•
Published
May 21
•
104
Efficient Agent Training for Computer Use
Paper
•
2505.13909
•
Published
May 20
•
44
Upvote
-
Share collection
View history
Collection guide
Browse collections