# README v3 → v4 — Change Log **Date:** 2026-05-26 **v3:** 653 lines · 6284 words **v4:** 699 lines · 7162 words **Delta:** +46 lines · +878 words (well under the +150-line / 800-line budget) The v4 pass incorporates the overnight 14-sub-agent stress loop findings ([`outputs/MASTER_REPORT.md`](./MASTER_REPORT.md), [`outputs/BUG_BACKLOG.md`](./BUG_BACKLOG.md)) while preserving v3's Medium-quality voice. --- ## What changed ### 1. Badges (top of README) - Tests badge: `149 Py + 30 Sol + 15 FE` → **`219 Py + 36 Jest + 30 Foundry`** (real counts post-overnight) - Slither badge: clarifies "0 High / 0 Medium" applies to **first-party** code (OZ `Math.sol` library noise excluded) - **New Smoke badge:** `Smoke 10/12 GREEN` linking to MASTER_REPORT.md - Tests + Slither badges now point at `MASTER_REPORT.md` instead of `final_audit_summary.md` (more current) ### 2. TL;DR paragraph 3 Added: builder code `0xa934…beb1` registered, Alchemy Polygon RPC bound, overnight stress loop reference, real-stack coverage figure (~85%), smoke 10/12 GREEN. ### 3. Real vs Mock: Honest Accounting - Phase 1 chain glue + dispatch rows: `PHASE 1 (landing)` → **`REAL (smoke verified)`** - New row: Alchemy Polygon RPC binding marked REAL with median latency - Coverage estimate: **25–30% → ~85%** with provenance (Phase 1 + overnight verification) - WARNING callout rewritten: the gap now is BLEU/COMET reference-lookup wiring (HIGH-1 in BUG_BACKLOG), not chain glue. MQM (the most informative of the three) is real. ### 4. The Numbers table Replaced "target 60–75 s" with **measured** values from `perf_benchmark.md`: | Row | v3 | v4 | |-----|-----|-----| | Lifecycle wall clock | 60–75 s target | p50 **65.87 s** measured, p95 ≥180 s on stalls | | API p95 | — | **8.7 – 29.3 ms** (`/events`, `/leaderboard`, `/events/{id}`) | | Backend cold start | — | **1.65 s** + Next.js FCP 90–760 ms | | FAISS lookup median | — | **16.07 ms** vs 100 ms budget | | Arc RPC eth_blockNumber | — | p50 **590.6 ms** · p95 **828.3 ms** | | Test suites total | — | **285 pass** (219 + 36 + 30) | | Slither verdict | post-hardening | post-hardening **first-party** clarified | ### 5. NEW SECTION — "Stress Tested Overnight (2026-05-26, 04:30–08:00 SGT)" **Position:** before "Audit + Hardening Pass" (chronologically the newer event). Covers: - 14 sub-agents launched in 3 waves over 3.5 hours - 600+ check items across 9 domains - 47 bugs catalogued, 27 auto-fixed - Before/after table on 9 surfaces (smoke 4→10/12, mobile 47%→81%, etc.) - 121 screenshots produced - **Demo readiness verdict: GREEN mechanism / YELLOW market** with explanation - Pointers to MASTER_REPORT.md and BUG_BACKLOG.md Voice: "we put it through a stress test loop" — not boastful, just earned. ### 6. Audit + Hardening Pass One-line transition update: "Before the overnight stress loop, an earlier 8-audit parallel pass ran…" — chronology now reads cleanly. ### 7. Roadmap Added a new "Production hardening" phase (1–4 weeks post-ship) capturing the three concrete Agent H production recommendations: - `BackgroundTasks` migration for `/trigger/event` (BLOCKER) - LLM timeout + circuit breaker around the 4-provider fan-out - `gunicorn --workers 4` + reverse proxy - BLEU/COMET reference-lookup wiring (HIGH-1) - Firefox SSE CORS fix (HIGH-2) Plus a follow-up paragraph explaining what each is, what surfaced it (Agent H perf benchmark), and that they are roughly day-of-work fixes, not architecture changes. ### 8. Arc capabilities table Added Alchemy Polygon RPC row with app id `ngx37mo60qae6ror` and median latency. ### 9. "The Numbers" intro polish Opener now lists the concrete corpus + event + bid + submission + test counts that back the "real data, not just simulated" claim. --- ## What stayed the same (deliberate) - 6 Mermaid diagrams (none added, none removed) - All 24 unique §5.X cross-reference anchors still resolve - Section order unchanged except the new "Stress Tested Overnight" insertion - Voice / tone preserved from v3 (Medium-quality narrative) - §5.30 honest-scope discipline maintained: no claim about proof-of-market - Mechanism design defaults table (locked parameters) untouched - License + Closing Thesis untouched --- ## Anchor-resolve verification Unique §5.X anchors in v4 (24 distinct): 50, 502, 503, 510, 515, 518, 521, 522, 527, 528, 530, 540, 5402, 541, 542, 543, 544, 546, 547, 548, 55, 551, 56, 57 All identical to v3 anchor set. No anchors added or dropped.