Cell 22 β Markdown Summary Table (Baseline β Final)
print_summary_table(baseline, final) returns the multi-section markdown
summary that ships in the HF blog and DESIGN.md Β§15 pitch:
- Per-reward (mean + 95% CI) β baseline β final β paired Ξ with CI.
- Per-language β baseline reward_mean β final β Ξ.
- Drift-detection latency β Stage 2/3 p50/p95 before vs after.
- Reward-hacking offenses β per-class baseline β final counts.
Contract: evaluation.md Β§3.3, Β§3.4, Β§3.5; DESIGN.md Β§13 deliverables #6 / #7.
Numeric cells round to 3 decimals (latency to 2). Paired Ξ pulled from
final.breakdown['paired_ci'] (populated by eval_final in step_19).