Spaces:

Chris4K
/

autoscan

Running

App Files Files Community

autoscan / docs /roadmap.md

Chris4K

Upload 384 files

a2a5bfd verified 1 day ago

preview code

raw

history blame contribute delete

7.99 kB

	# Roadmap

	This document tracks planned features, improvements, and known technical debt for autoscan / SENTINEL.

	Items are grouped by priority tier. The "Done" section is a record of completed milestones.

	---

	## Done ✅ (v5.0)

	\| Feature \| Notes \|
	\|---------\|-------\|
	\| FastAPI web application (Sentinel) \| Replaces Gradio-only UI; multi-user ready \|
	\| HuggingFace Space discovery \| Search, filter by stage / hardware / framework / MCP \|
	\| Parallel scan execution \| `ThreadPoolExecutor`, SSE live progress stream \|
	\| Per-tool scanner selection \| Individual tools selectable in Discover UI and API \|
	\| html2pdf.js export \| Client-side PDF, no server dependency \|
	\| Notifications panel \| Bell icon, mark-read, delete \|
	\| Bootstrap binaries \| Auto-download gitleaks + hadolint on startup \|
	\| Share links \| Time-limited read-only scan URLs \|
	\| Insights page \| Severity breakdown, 14-day trend, top targets \|
	\| Knowledge Base \| Searchable remediation articles \|
	\| Schedules (APScheduler) \| Cron-based automated scans \|
	\| AI Explainer \| Ollama / OpenAI per-finding annotations \|
	\| CVE data externalization (T15) \| `cve_data.json` + `cve_data_schema.py`; runner loads from JSON; backward-compat `CVE_TRIGGERS` dict preserved \|
	\| CVE feed refresh job (T16+21) \| OSV.dev + GitHub Advisories fetch for 26 packages; weekly APScheduler job (Mon 06:00); startup stale-check; `POST /api/cve-feed/refresh`; Notification row on new CVEs \|
	\| Confidence scoring layer (T17) \| `core/scoring.py` — 0–10 risk score per finding; wired into `scan_repo()`; `score` + `h1_draft` DB columns; Alembic migration `c3d4e5f6a7b8` \|
	\| H1 auto-draft (T18) \| `sentinel/services/h1_draft.py`; LLM generates HackerOne-style report for score≥7 findings; collapsible panel in findings table \|
	\| Score badge UI \| Color-coded 0–10 risk badge in findings table (red≥9, orange≥7, yellow≥4, gray<4) \|
	\| AI explainer prompt docs (T19/T20) \| `docs/weekly_update_prompt.md`, `docs/quarterly_research_prompt.md` \|
	\| Self-scan fixes (self-improvement) \| Fixed `openai-no-max-tokens` LLM10 bug; simplified redundant except tuples; added 6× `# noqa: BLE001` FP annotations; extended `.hfscanignore` + `.agent-audit.yaml` \|
	\| Alembic migrations \| Schema versioning, Sprint 6 indexes + ShareLinks \|
	\| Test suite \| 422 tests; `test_cve_data_schema.py` (16), `test_cve_feed.py` (10 async), `test_scoring.py` (16), `test_h1_draft.py` (12) \|
	\| SARIF 2.1.0 output \| GitHub code-scanning compatible \|
	\| `.hfscanignore` suppression \| Path / rule / severity filters \|
	\| Baseline workflow \| Fingerprint-based new-findings-only mode \|

	---

	## Near-term (v5.1) 🔜

	### Authentication & multi-user

	- [ ] Session-based login with password hashing (bcrypt)
	- [ ] Per-user targets, scans, and notifications (currently hardcoded `user_id=1`)
	- [ ] Role-based access: admin, analyst, read-only
	- [ ] API tokens for CI/CD integrations

	### CI/CD integration improvements

	- [ ] Webhook trigger: POST to `/api/scan/webhook` to start a scan from GitHub Actions
	- [ ] Status badge endpoint (already exists at `/badge/{target_id}`) — document in README
	- [ ] PR comment integration: post findings summary to GitHub/GitLab PR via API

	### Scanner coverage

	- [ ] Trivy — container image and IaC scanning (Dockerfile + SBOM)
	- [ ] OSV-Scanner — open-source vulnerability database (alternative to pip-audit)
	- [ ] Checkov — Terraform / K8s / Dockerfile policy checks
	- [ ] truffleHog — deep git history secret scan (alternative to gitleaks)

	### Reporting

	- [ ] CSV and XLSX export of findings
	- [ ] SBOM (Software Bill of Materials) generation (CycloneDX / SPDX)
	- [ ] Finding diff between two scans (regression view)
	- [ ] Email report on scan completion (SMTP already wired, needs template)

	---

	## Medium-term (v5.2) 📅

	### Performance & scalability

	- [ ] Replace in-process `ThreadPoolExecutor` with a proper task queue (Celery + Redis or ARQ)
	- [ ] PostgreSQL support (already parameterised via `DATABASE_URL`, needs integration test)
	- [ ] Horizontal scaling: multiple Uvicorn workers with shared task queue
	- [ ] Caching layer for HuggingFace API responses (reduce rate-limit hits)

	### UI improvements

	- [ ] Dark mode persistence (Alpine.js localStorage — partial)
	- [ ] Bulk triage: apply status change to all selected findings
	- [ ] Findings diff view: compare two scans side-by-side
	- [ ] Target groups / tags for organising many monitored spaces
	- [ ] Paginated findings table (currently loads all findings in one query)
	- [ ] Keyboard shortcuts (e.g. `n`/`p` for next/prev finding, `x` to triage)

	### AI Explainer

	- [ ] Anthropic (Claude) backend
	- [ ] Batch mode: explain all findings in a scan in one request (reduce API calls)
	- [ ] Store explanations in DB; don't re-explain the same fingerprint twice
	- [ ] Quality feedback button (👍 / 👎) to improve prompt tuning

	### Onboarding

	- [ ] Step-by-step first-run wizard is complete — but needs a "skip and seed demo data" button
	- [ ] Demo scan against a known-vulnerable HF space for new users

	---

	## Long-term (v6.0) 🔮

	### ML-powered triage

	- [ ] ML model trained on triage decisions to auto-suggest status
	- [ ] Anomaly detection: flag repos whose risk score changes sharply between scans
	- [ ] Cluster similar findings (same rule, same file pattern) across all targets

	### Policy engine

	- [ ] Define organisational policies (e.g. "no ERROR findings in production spaces")
	- [ ] Block HF Space deployment if policy violations found (via HF Spaces API)
	- [ ] Policy-as-code: YAML-defined rules stored in the repo

	### Integrations

	- [ ] Slack / Teams alert webhook on high-severity findings
	- [ ] Jira / Linear ticket creation from findings
	- [ ] OPA (Open Policy Agent) for fine-grained authorization rules
	- [ ] SCIM / SSO (Okta, Azure AD) for enterprise deployments

	### Distributed scanning

	- [ ] Agent model: lightweight scanner agents deployed close to target repos
	- [ ] Central SENTINEL server aggregates results from multiple agents
	- [ ] Support GitHub, GitLab, Bitbucket repos (not only HuggingFace)

	---

	## Technical debt 🧹

	\| Item \| Severity \| Notes \|
	\|------\|----------\|-------\|
	\| `user_id=1` hardcoded throughout sentinel/ \| High \| Blocks multi-user \|
	\| `sentinel/services/scanner.py` test coverage at 22% \| High \| Core async worker needs deep async mock tests \|
	\| `sentinel/routes/scan.py` test coverage at 36% \| High \| SSE + PDF export + triage routes uncovered \|
	\| `sentinel/services/ai_explain.py` test coverage at 26% \| Medium \| Mock LLM client tests needed \|
	\| `sentinel/jobs/scheduler.py` test coverage at 44% \| Medium \| Scheduler logic needs async mock tests \|
	\| `sentinel/routes/kb.py` test coverage at 52% \| Medium \| KB CRUD (create/update/delete) untested \|
	\| `sentinel/routes/share.py` test coverage at 50% \| Medium \| Share-view handler body not reached (importlib.reload issue) \|
	\| Coverage tracking note \| — \| `importlib.reload()` in test fixtures prevents pytest-cov from tracking route handler bodies; effective coverage is higher than shown \|
	\| `detect-secrets` JSON format fragile \| Low \| Pin version; upstream API changes \|
	\| E2E Playwright tests require live server \| Low \| Improve fixture isolation \|
	\| `pyproject.toml` and `pytest.ini` both define pytest config \| Low \| Consolidate into `pyproject.toml` \|
	\| Gradio `app.py` is legacy \| Low \| Remove or move to `legacy/` once v5 is confirmed stable \|

	---

	## Version history

	\| Version \| Date \| Highlights \|
	\|---------\|------\|------------\|
	\| v5.0 \| May 2026 \| Sentinel FastAPI app, per-tool selection, html2pdf export, bootstrap binaries \|
	\| v4.0 \| 2025 \| Gradio UI, SARIF output, CLI, Semgrep rule packs, baseline workflow \|
	\| v3.x \| 2025 \| Multi-tool parallel scanning, ThreadPoolExecutor \|
	\| v1–v2 \| 2024 \| Initial single-tool scanner, Bandit only \|