File size: 231 Bytes
f707fd4
 
 
 
 
1
2
3
4
5
6
"""Evaluation harness: compare checkpoints across tiers and print the markdown table.

TODO (Phase 15): implement CLI per PROJECT.md Section 20. Must reproduce within
2% across 50-rollout evaluations (PROJECT.md Section 20.4).
"""