Commit History

feat(goldens): add source_snippets to 8 FastAPI calibration items
a48afb9

Nomearod Claude Opus 4.7 (1M context) commited on

feat(calibration): 30-item stratified calibration_v1 sample
8ef480a

Nomearod Claude Opus 4.7 (1M context) commited on

feat(eval): Week 1 step 5 β€” 25-question K8s golden dataset + grounded_refusal fix
4454894

Nomearod Claude Opus 4.6 (1M context) commited on

feat: K8s pilot corpus β€” 8 pages + config entry + JSON rewrite
ce7247c

Nomearod Claude Opus 4.6 (1M context) commited on

feat: add 6-question K8s golden pilot dataset
3484214

Nomearod Claude Opus 4.6 (1M context) commited on

fix: grounded refusal checks no-sources, reference_answer for judge, mock disclaimer
520796c

Nomearod Claude Opus 4.6 (1M context) commited on

feat: Day 7 β€” evaluation harness, metrics, report, expanded golden dataset
c378584

Nomearod Claude Opus 4.6 (1M context) commited on

feat: Day 4 β€” corpus, ingest script, first 10 golden questions
a152b95

Nomearod Claude Opus 4.6 (1M context) commited on