CYBERSECEVAL 3: Advancing the Evaluation of Cybersecurity Risks and Capabilities in Large Language Models Paper • 2408.01605 • Published Aug 2, 2024 • 2
CyberSecEval 2: A Wide-Ranging Cybersecurity Evaluation Suite for Large Language Models Paper • 2404.13161 • Published Apr 19, 2024
How Vulnerable Are AI Agents to Indirect Prompt Injections? Insights from a Large-Scale Public Competition Paper • 2603.15714 • Published Mar 16
LlamaFirewall: An open source guardrail system for building secure AI agents Paper • 2505.03574 • Published May 6, 2025