The Blind Spot of Agent Safety: How Benign User Instructions Expose Critical Vulnerabilities in Computer-Use Agents Paper • 2604.10577 • Published 27 days ago • 24
Structured Distillation of Web Agent Capabilities Enables Generalization Paper • 2604.07776 • Published 30 days ago • 22
CoAct-1: Computer-using Agents with Coding as Actions Paper • 2508.03923 • Published Aug 5, 2025 • 13
Feedback Friction: LLMs Struggle to Fully Incorporate External Feedback Paper • 2506.11930 • Published Jun 13, 2025 • 53
The Hallucination Tax of Reinforcement Finetuning Paper • 2505.13988 • Published May 20, 2025 • 8
Efficient Reinforcement Finetuning via Adaptive Curriculum Learning Paper • 2504.05520 • Published Apr 7, 2025 • 11
Discovering Knowledge Deficiencies of Language Models on Massive Knowledge Base Paper • 2503.23361 • Published Mar 30, 2025 • 5
TrustLLM: Trustworthiness in Large Language Models Paper • 2401.05561 • Published Jan 10, 2024 • 69