Spaces:

rohitsar567
/

InsuranceBot

Sleeping

App Files Files Community

InsuranceBot / audit /README.md

rohitsar567

feat(audit): selftest in pytest gate (static-scoped, no recursion) + opt-in git hooks + README

22a03fc about 2 months ago

preview code

Raw

History Blame Contribute Delete

3.8 kB

`audit/` — the repo's self-verifying error/risk gate

One runnable command that asserts the repo is sound. 24 checks across 5 tiers. Every check traces to a real incident this project hit (LFS quota silent failures, deleted-module import breakage, ChromaDB disk bloat, etc.) — the auditor exists so those classes of failure cannot ship silently again.

The auditor itself is verified: --selftest reconstructs the broken state of each incident in a temp fixture and asserts the matching check still flags it. A check that no longer detects its incident is reported as "not self-verifying".

CLI

tools/audit.sh --static|--build|--functional|--deploy|--all|--selftest [--json]

tools/audit.sh just execs .venv/bin/python -m audit "$@" from the repo root. (Note: tools/audit.sh is the entrypoint — unrelated to the pre-existing tools/audit/ tool directory.)

--json emits the per-check results as a JSON array instead of the text report (id / status / evidence / remediation).

Tiers

Flag	Runs	Checks	When to run
`--static`	static	T1.* + T2.* (11)	Pre-commit. Seconds. Pure repo/AST/import/lint — no build, no network, no servers.
`--build`	static + build	+ T3.* (14)	Before push. Runs the full `pytest` gate, `next build` (production static export), and boots the backend for `/api/health`. Minutes.
`--functional`	static + build + functional	+ T4.* (20)	Needs a local backend on :8000 and frontend up. API smoke (health/coverage/chat/upload/profile) + Playwright E2E journeys.
`--deploy`	deploy only	T5.* (4)	Read-only prod/deploy safety: LFS pre-push simulation, Dockerfile coherence, deployed-SHA vs local, standing tripwires.
`--all`	static + build + functional + deploy	all 24	Everything. Run with backend (and ideally frontend) up.

Exit codes

Exit is non-zero iff any check is FAIL.
WARN never fails the gate (it surfaces a soft / deferred condition, e.g. T5.3 deferred-deploy SHA mismatch or T2.3 a stale-doc note).
SKIP (e.g. a prerequisite service is down) also does not fail the gate.

`--selftest`

.venv/bin/python -m audit --selftest        # full: 24 checks

For every check, a fixture stands up the broken state from the original incident; the check must return its selftest_expect status on that fixture (FAIL for most; WARN for T5.3 / T5.4 which are advisory by design). Output: 24 checks · 0 not self-verifying. A non-zero "not self-verifying" count means the auditor has silently rotted and is no longer catching its own incident.

The pytest gate (tests/test_audit_selftest.py) runs the selftest scoped to the static tier only — core.selftest(only_tiers={"static"}). The build/functional/deploy tiers are deliberately excluded from the unit gate: T3.1's fixture shells pytest, which would recurse into that very test; T3.2 runs a multi-minute next build; Tier 4 needs a live backend. The full 24-check selftest is run via tools/audit.sh --selftest (manual / pre-push).

Opt-in git hooks

.githooks/ ships a pre-commit (--static) and a pre-push (--build then --deploy). They are opt-in — enable per clone with:

git config core.hooksPath .githooks

This repo does not set that automatically; nothing changes your git config until you run the command above.

Tier 5 is read-only against prod

Every Tier 5 check is strictly read-only with respect to deployed infrastructure: it inspects local artifacts (Dockerfile, LFS attributes, tripwire state) and at most reads the deployed SHA for comparison. It never pushes, deploys, mutates remote state, or touches the HF Space — so --deploy and --all are safe to run at any time.

audit/ — the repo's self-verifying error/risk gate

CLI