Grouped Query Experts: Mixture-of-Experts on GQA Self-Attention Paper • 2606.20945 • Published 8 days ago • 62
EnterpriseClawBench: Benchmarking Agents from Real Workplace Sessions Paper • 2606.23654 • Published 4 days ago • 75
GateMem: Benchmarking Memory Governance in Multi-Principal Shared-Memory Agents Paper • 2606.18829 • Published 9 days ago • 17
💻 Qwopus-Coder Collection Reasoning-distilled coding models optimized for specialized domains like agentic workflows. • 7 items • Updated 11 days ago • 25
Jackrong/Qwopus3.6-27B-Coder-Compat-MTP-GGUF Image-Text-to-Text • 0.5B • Updated 5 days ago • 19.4k • 84
HackSynth: LLM Agent and Evaluation Framework for Autonomous Penetration Testing Paper • 2412.01778 • Published Dec 2, 2024 • 3
Environment-Grounded Multi-Agent Workflow for Autonomous Penetration Testing Paper • 2603.24221 • Published Mar 25 • 1
Getting pwn'd by AI: Penetration Testing with Large Language Models Paper • 2308.00121 • Published Jul 24, 2023 • 1
Knowledge-Informed Auto-Penetration Testing Based on Reinforcement Learning with Reward Machine Paper • 2405.15908 • Published May 24, 2024 • 1
Pentest-R1: Towards Autonomous Penetration Testing Reasoning Optimized via Two-Stage Reinforcement Learning Paper • 2508.07382 • Published Aug 10, 2025 • 2
PentestGPT: An LLM-empowered Automatic Penetration Testing Tool Paper • 2308.06782 • Published Aug 13, 2023 • 1
CIPHER: Cybersecurity Intelligent Penetration-testing Helper for Ethical Researcher Paper • 2408.11650 • Published Aug 21, 2024 • 3
Large Language Models in Cybersecurity: State-of-the-Art Paper • 2402.00891 • Published Jan 30, 2024 • 4
Purple Llama CyberSecEval: A Secure Coding Benchmark for Language Models Paper • 2312.04724 • Published Dec 7, 2023 • 22
CyberPal.AI: Empowering LLMs with Expert-Driven Cybersecurity Instructions Paper • 2408.09304 • Published Aug 17, 2024 • 1