arxiv:2606.20636

SkillHarness: Harnessing Safe Skills for Computer-Use Agents

Published on Jun 2

· Submitted by

Yurun Chen on Jun 23

Zhejiang University

Upvote

Authors:

Abstract

SkillHarness is a framework that enables computer-use agents to safely learn and execute skills in dynamic environments by incorporating safety constraints and adaptive skill selection mechanisms.

Generated by Qwen/Qwen2.5-Coder-32B-Instruct

Computer-Use Agents (CUAs) are increasingly deployed in dynamic interactive environments, creating a growing need for continual skill learning during interaction. Recent approaches address this challenge by learning reusable skills from successful trajectories. However, these skill learning methods largely assume static and safe environments, overlooking risks from adversarial interactions (e.g., prompt injections) and environmental dynamics (e.g., pop-ups). In dynamic settings, such assumptions can lead to risky skill learning and brittle execution, undermining the reliability of CUAs. This raises the question: how can CUAs learn and use skills safely in dynamic environments? To address this problem, we propose SkillHarness, a framework for safe skill harnessing in dynamic environments. SkillHarness moves beyond static skill abstractions by modeling skill learning and utilization as a safety-constrained interaction process. Specifically, we introduce the skill boundary that leverages multi-source supervision signals to identify safe skills from interaction trajectories, and construct self-improving safety constraints throughout the skill lifecycle. In addition, SkillHarness introduces selective skill reuse, where tasks are guided to decompose according to context and completed through the selective activation of skill subsets. Our experiments demonstrate that SkillHarness significantly reduces the unsafe rate of learned skills by 57.1% and consistently improves execution stability under dynamic environmental changes, outperforming existing baselines.

View arXiv page View PDF GitHub 1 Add to collection

Community

yurun-chen

Paper submitter about 3 hours ago

We introduce SkillHarness, a framework for safer skill learning and reuse in computer-use agents. Existing skill-learning methods usually extract reusable skills from successful trajectories, but in dynamic environments this can encode unsafe behaviors from prompt injections, policy violations, pop-ups, or brittle UI-specific action flows.
SkillHarness treats skills as context-dependent capabilities rather than fixed scripts. It builds explicit skill boundaries from multiple supervision signals, including successful trajectories, failures, and detected risks, then uses a selective reuse mechanism that activates skills only when their safety and applicability conditions are satisfied.
Across ST-WebAgentBench, WASP, OS-Harm, and OpenApps, SkillHarness reduces unsafe learned skills and improves robustness under adversarial and changing environments. We hope this work provides a useful step toward continual skill learning for computer-use agents that is not only effective, but also safer and more reliable in real-world settings.

GitHub: https://github.com/YurunChen/SkillHarness
Arxiv: https://arxiv.org/abs/2606.20636