AIDE: Task-Specific Fine Tuning with Attribute Guided Multi-Hop Data Expansion Paper • 2412.06136 • Published Dec 9, 2024
Graph of Attacks with Pruning: Optimizing Stealthy Jailbreak Prompt Generation for Enhanced LLM Content Moderation Paper • 2501.18638 • Published Jan 28, 2025 • 1