Commit History

Switch faithfulness to text_pair encoding, promote score logging to INFO
29f3273

mbochniak01 Claude Sonnet 4.6 commited on

Add telemetry layer: in-memory counters + HF Dataset persistence
c79d967

mbochniak01 Claude Sonnet 4.6 commited on

Fix Vectara label check and input format
5935cf6

mbochniak01 Claude Sonnet 4.6 commited on

Add /refresh-cache endpoint, bi-encoder comparison, eval results, Ollama/Prometheus notes
e77a2f2

mbochniak01 Claude Sonnet 4.6 commited on

Add joke short-circuit and reset attempt handler
27156ca

mbochniak01 Claude Sonnet 4.6 commited on

Load T5-small tokenizer for Vectara HHEM v2
14d263b

mbochniak01 Claude Sonnet 4.6 commited on

Use T5Tokenizer directly for Vectara HHEM v2
69c362c

mbochniak01 Claude Sonnet 4.6 commited on

Fix Vectara pipeline: explicitly load tokenizer before pipeline init
86cfc1b

mbochniak01 Claude Sonnet 4.6 commited on

Load Vectara model via transformers pipeline, not CrossEncoder
a42a9e0

mbochniak01 Claude Sonnet 4.6 commited on

Add trust_remote_code=True for Vectara hallucination model
cbb4147

mbochniak01 Claude Sonnet 4.6 commited on

Switch faithfulness grader to Vectara hallucination evaluation model
eb90c62

mbochniak01 Claude Sonnet 4.6 commited on

Exclude pinned glossary doc from faithfulness grading context
cd174a4

below-threshold commited on

Replace enforce_terminology with pinned glossary doc in RAG context
76be5a0

below-threshold commited on

Add enforce_terminology: deterministic post-processing corrective gate
54a5940

below-threshold commited on

Faithfulness: mean sentence scoring, strip chunk title prefix, lower threshold to 0.35
cd30e2d

below-threshold commited on

Pre-build KB indexes at startup, not on first query
aef9f0f

below-threshold commited on

Load all KB formats merged — drop CSV directly, no conversion needed
2a47292

below-threshold commited on

Inject client terminology into system prompt
99649f6

below-threshold commited on

Fix faithfulness: score per chunk, take max entailment
7b3dadd

below-threshold commited on

Switch generation model to Llama-3-8B-Instruct
6e6032f

below-threshold commited on

Replace Anthropic with free-tier stack
ebb06ed

below-threshold commited on

Add full RAG evaluation pipeline with L1 metrics and UI
ebe934f

mbochniak01 commited on