Spaces:
Running
Running
Commit History
fix(judges,calibration,harness): three Codex adversarial-review findings 226b6f4
feat(calibration): six row configs for the ΞΊ ablation table cf57f16
feat(eval): K8s refusal_threshold sweep against 25Q set β 0.015 validated 2d1d822
feat(eval): Week 1 step 5 β 25-question K8s golden dataset + grounded_refusal fix 4454894
chore(eval): pin gpt-4o-mini snapshot + wire fastapi golden_dataset + pre-commit tolerances 5c1f49f
feat(eval): K8s refusal_threshold 0.02 β 0.015 empirical calibration 125dac0
feat: K8s pilot corpus β 8 pages + config entry + JSON rewrite ce7247c
fix: batch-3 adversarial review findings 42c7303
feat: K8s corpus config entry, ingestion target, curation policy 3c0089e
feat(security): add security config models to AppConfig 4717d76
feat: infrastructure sprint β vLLM/Modal, Helm, Terraform (#8) a9d4375
Jane Yeung Claude Opus 4.6 (1M context) commited on