Fix sentinel edge cases: hallucination combo guard + UI formatting 8d335e4 Running mbochniak01 Claude Sonnet 4.6 commited on May 14
Replace ad-hoc refusal regexes with NOT IN DOCUMENTS sentinel 0ad5e39 mbochniak01 Claude Sonnet 4.6 commited on May 14
Fix refusal detector missing 'not able to' and 'outside scope' phrases 7ee90da mbochniak01 Claude Sonnet 4.6 commited on May 14
Show violation details in flagged response banner e6d6240 mbochniak01 Claude Sonnet 4.6 commited on May 14
Fix Dockerfile: replace HHEM pre-warm with nli-deberta-v3-small 4e97a7e mbochniak01 Claude Sonnet 4.6 commited on May 14
Replace HHEM with sentence-level NLI, add claim decomposition and drift detection ffbf46f mbochniak01 Claude Sonnet 4.6 commited on May 13
Fix compat, bugs, and types; expand retail KB e181667 mbochniak01 Claude Sonnet 4.6 commited on May 11
Switch faithfulness to text_pair encoding, promote score logging to INFO 29f3273 mbochniak01 Claude Sonnet 4.6 commited on May 7
Add telemetry layer: in-memory counters + HF Dataset persistence c79d967 mbochniak01 Claude Sonnet 4.6 commited on May 7
Add /refresh-cache endpoint, bi-encoder comparison, eval results, Ollama/Prometheus notes e77a2f2 mbochniak01 Claude Sonnet 4.6 commited on May 5
Add joke short-circuit and reset attempt handler 27156ca mbochniak01 Claude Sonnet 4.6 commited on May 5
Fix Vectara pipeline: explicitly load tokenizer before pipeline init 86cfc1b mbochniak01 Claude Sonnet 4.6 commited on May 4
Pin transformers<4.46.0 to fix Vectara HHEM v2 compatibility b2eeefb mbochniak01 Claude Sonnet 4.6 commited on May 4
Load Vectara model via transformers pipeline, not CrossEncoder a42a9e0 mbochniak01 Claude Sonnet 4.6 commited on May 4
Speed up Docker builds: .dockerignore + merge model download layers 1ea03d4 mbochniak01 Claude Sonnet 4.6 commited on May 4
Add trust_remote_code=True for Vectara hallucination model cbb4147 mbochniak01 Claude Sonnet 4.6 commited on May 4
Switch faithfulness grader to Vectara hallucination evaluation model eb90c62 mbochniak01 Claude Sonnet 4.6 commited on May 4
Exclude pinned glossary doc from faithfulness grading context cd174a4 below-threshold commited on May 4
Replace enforce_terminology with pinned glossary doc in RAG context 76be5a0 below-threshold commited on May 4
Add enforce_terminology: deterministic post-processing corrective gate 54a5940 below-threshold commited on May 4
Faithfulness: mean sentence scoring, strip chunk title prefix, lower threshold to 0.35 cd30e2d below-threshold commited on May 4
Load all KB formats merged β drop CSV directly, no conversion needed 2a47292 below-threshold commited on May 1
UX: welcome message with example questions, specific failure details in verdict f7a25db below-threshold commited on May 1
Update ARCHITECTURE.md and README.md to reflect client library and test suite 1f6dac5 mbochniak01 commited on May 1
Add typed client library, unit + integration tests, mypy, ruff, NOTES.md 10aced5 mbochniak01 commited on May 1
Add Makefile, HTML eval report generator, gitignore for reports 3c949de mbochniak01 commited on May 1