On Time, Within Budget: Constraint-Driven Online Resource Allocation for Agentic Workflows Paper • 2605.06110 • Published 2 days ago • 16
Do Not Waste Your Rollouts: Recycling Search Experience for Efficient Test-Time Scaling Paper • 2601.21684 • Published Jan 29 • 10
Training Data Efficiency in Multimodal Process Reward Models Paper • 2602.04145 • Published Feb 4 • 79