--- title: SecureReview emoji: ๐ก colorFrom: gray colorTo: indigo sdk: docker app_port: 7860 pinned: true license: mit tags: - openenv - security - code-review - agent - evaluation - rl short_description: The agent review benchmark for the age of AI. ---
| ### I. Dependency & Supply Chain Security Identify typosquatted packages, hallucinated imports that do not exist on PyPI, and pinned versions with active CVEs. Tests the baseline of supply-chain literacy every reviewer should have. `requirements.txt` ยท `package.json` **24 scenarios ยท 120 findings ยท 15 steps** ##### Easy | ### II. Infrastructure-as-Code Misconfiguration Detection Catch CIS-benchmark violations in Terraform and Kubernetes โ public buckets, wildcard IAM, missing encryption, privileged containers, cross-account trust. Tests multi-file cloud security reasoning. Terraform `.tf` ยท Kubernetes YAML **24 scenarios ยท 155 findings ยท 25 steps** ##### Medium | ### III. Database Migration Safety Analysis Reason about SQL migrations against live production context โ table sizes, write throughput, deployment strategy, downstream services. Tests the hardest form of review: **judgment**. Schema ยท migrations ยท app code **28 scenarios ยท 155 findings ยท 35 steps** ##### Hard |