VulnLLM-R: Specialized Reasoning LLM with Agent Scaffold for Vulnerability Detection Paper • 2512.07533 • Published 21 days ago • 2
SecCodePLT: A Unified Platform for Evaluating the Security of Code GenAI Paper • 2410.11096 • Published Oct 14, 2024 • 13
OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation Paper • 2505.23885 • Published May 29
AgentVigil: Generic Black-Box Red-teaming for Indirect Prompt Injection against LLM Agents Paper • 2505.05849 • Published May 9
DVERGE: Diversifying Vulnerabilities for Enhanced Robust Generation of Ensembles Paper • 2009.14720 • Published Sep 30, 2020
OpenOOD v1.5: Enhanced Benchmark for Out-of-Distribution Detection Paper • 2306.09301 • Published Jun 15, 2023 • 1
Mixture Outlier Exposure: Towards Out-of-Distribution Detection in Fine-grained Environments Paper • 2106.03917 • Published Jun 7, 2021
Min-K%++: Improved Baseline for Detecting Pre-Training Data from Large Language Models Paper • 2404.02936 • Published Apr 3, 2024 • 3