PGrad: Learning Principal Gradients For Domain Generalization Paper • 2305.01134 • Published May 2, 2023 • 1
TurboFuzzLLM: Turbocharging Mutation-based Fuzzing for Effectively Jailbreaking Large Language Models in Practice Paper • 2502.18504 • Published Feb 21, 2025 • 1
A Closer Look at Adversarial Suffix Learning for Jailbreaking LLMs: Augmented Adversarial Trigger Learning Paper • 2503.12339 • Published Mar 16, 2025 • 1
Zero-knowledge LLM hallucination detection and mitigation through fine-grained cross-model consistency Paper • 2508.14314 • Published Aug 19, 2025 • 1
Graph of Attacks with Pruning: Optimizing Stealthy Jailbreak Prompt Generation for Enhanced LLM Content Moderation Paper • 2501.18638 • Published Jan 28, 2025 • 1