On-Policy Self-Evolution via Failure Trajectories for Agentic Safety Alignment Paper • 2605.11882 • Published 2 days ago • 13
Vision-Language-Action Safety: Threats, Challenges, Evaluations, and Mechanisms Paper • 2604.23775 • Published 18 days ago • 45
Anatomy of a Lie: A Multi-Stage Diagnostic Framework for Tracing Hallucinations in Vision-Language Models Paper • 2603.15557 • Published Mar 16 • 29