WebArbiter: A Principle-Guided Reasoning Process Reward Model for Web Agents
Paper
• 2601.21872 • Published
None defined yet.
WebArbiter: A Principle-Guided Reasoning Process Reward Model for Web Agents
GroundedPRM: Tree-Guided and Fidelity-Aware Process Reward Modeling for Step-Level Reasoning