Spencer Whitman's picture

Spencer Whitman

SpencerSwan

·

AI & ML interests

AI security & safety

Recent Activity

authored a paper 10 days ago

The Llama 3 Herd of Models

authored a paper 10 days ago

CYBERSECEVAL 3: Advancing the Evaluation of Cybersecurity Risks and Capabilities in Large Language Models

authored a paper 10 days ago

CyberSecEval 2: A Wide-Ranging Cybersecurity Evaluation Suite for Large Language Models

View all activity

Organizations

authored 5 papers 10 days ago

The Llama 3 Herd of Models

Paper • 2407.21783 • Published Jul 31, 2024 • 118

CYBERSECEVAL 3: Advancing the Evaluation of Cybersecurity Risks and Capabilities in Large Language Models

Paper • 2408.01605 • Published Aug 2, 2024 • 2

CyberSecEval 2: A Wide-Ranging Cybersecurity Evaluation Suite for Large Language Models

Paper • 2404.13161 • Published Apr 19, 2024

How Vulnerable Are AI Agents to Indirect Prompt Injections? Insights from a Large-Scale Public Competition

Paper • 2603.15714 • Published Mar 16

LlamaFirewall: An open source guardrail system for building secure AI agents

Paper • 2505.03574 • Published May 6, 2025