feat(goldens): add source_snippets to 8 FastAPI calibration items a48afb9 Nomearod Claude Opus 4.7 (1M context) commited on 15 days ago
feat(calibration): 30-item stratified calibration_v1 sample 8ef480a Nomearod Claude Opus 4.7 (1M context) commited on 15 days ago
feat(eval): Week 1 step 5 β 25-question K8s golden dataset + grounded_refusal fix 4454894 Nomearod Claude Opus 4.6 (1M context) commited on Apr 14
feat: K8s pilot corpus β 8 pages + config entry + JSON rewrite ce7247c Nomearod Claude Opus 4.6 (1M context) commited on Apr 13
feat: add 6-question K8s golden pilot dataset 3484214 Nomearod Claude Opus 4.6 (1M context) commited on Apr 13
fix: grounded refusal checks no-sources, reference_answer for judge, mock disclaimer 520796c Nomearod Claude Opus 4.6 (1M context) commited on Mar 24
feat: Day 7 β evaluation harness, metrics, report, expanded golden dataset c378584 Nomearod Claude Opus 4.6 (1M context) commited on Mar 24
feat: Day 4 β corpus, ingest script, first 10 golden questions a152b95 Nomearod Claude Opus 4.6 (1M context) commited on Mar 24