Commit History

Improve validation and resource handling
26ca7c1

vxa8502 commited on

Add Qdrant retry logic
a68e765

vxa8502 commited on

Add percentile comment
5c74b4d

vxa8502 commited on

Replace manual dict construction
daa087a

vxa8502 commited on

Rename citation fields, strengthen test assertions from loose bounds
8f9ddf0

vxa8502 commited on

Add multi-failure test for evidence quality gate
45581bd

vxa8502 commited on

Simplified cold-start, API takes query directly
b355bb6

vxa8502 commited on

Use canonical is_refusal() from faithfulness module
ff5a9a1

vxa8502 commited on

Replace custom percentile with np.percentile
1ea55b9

vxa8502 commited on

Add integrity verification to cache thread safety test
2ca58b5

vxa8502 commited on

Use MagicMock directly as default_factory for simple cases
a96b9bf

vxa8502 commited on

Extract magic numbers to constants incl other polishing
de28225

vxa8502 commited on

Standardize metrics API to use seconds consistently
42696b2

vxa8502 commited on

Harden middleware shutdown handling and security headers
4145799

vxa8502 commited on

Fix graceful shutdown resource handling
99746a0

vxa8502 commited on

Harden summary.py
78a6ff2

vxa8502 commited on

Calibrate evidence quality gate thresholds
fbc14e7

vxa8502 commited on

Improve metric reporting
adbdbc8

vxa8502 commited on

Add mixed evidence handling to explanation prompts
6eebc10

vxa8502 commited on

Add bootstrap confidence intervals to evaluation metrics
ca96fbf

vxa8502 commited on

Refactor sanity checks with improved adversarial detection
a9bab1a

vxa8502 commited on

Replace direct file writes to *_latest.json with atomic symlink pattern
b799e56

vxa8502 commited on

Add retry logic for transient LLM failures
7707ed9

vxa8502 commited on

Add query sanitization for injection defense
7dd0820

vxa8502 commited on

Split CI into quick and full modes
1bb3f41

vxa8502 commited on

Track rated_count incrementally
2d8c1dc

vxa8502 commited on

Add test coverage for shutdown, lifespan, and cache paths
2ebf3a4

vxa8502 commited on

Add partial result indicator and explanation timeout metrics
ac6a1f1

vxa8502 commited on

Expand /recommend/stream test coverage
3276051

vxa8502 commited on

Fix LLM retry timing metrics
8cc6772

vxa8502 commited on

Test POST /recommend/stream SSE event flow
1d44c69

vxa8502 commited on

Consolidate citation verification
9197e72

vxa8502 commited on

Local make ci now matches GitHub CI
4cdd0ab

vxa8502 commited on

Clean up comments
676a738

vxa8502 commited on

Add structured logging
a76cf24

vxa8502 commited on

Add concurrent load testing for SemanticCache
8a319ac

vxa8502 commited on

Add singleton pattern to Qdrant client
6fcca43

vxa8502 commited on

Add RefusalType enum and query sanitization
790d141

vxa8502 commited on

Add punctuation normalization
7a112a4

vxa8502 commited on

Extract Qdrant pagination helper
e1502f6

vxa8502 commited on

Add boundary test for evidence quality gate
a2ac418

vxa8502 commited on

Improve low-relevance refusal message
05f5de3

vxa8502 commited on

Remove unused verify_explanation_full
7ac43cc

vxa8502 commited on

Fix docstring
f79f258

vxa8502 commited on

Fix precision_at_k
b10301c

vxa8502 commited on

Normalize provider string input
665e3a0

vxa8502 commited on

Add empty list validation
81efce5

vxa8502 commited on

Add explicit pydantic dependency
3d9b12e

vxa8502 commited on

Reorder index creation before upload
7f35c0f

vxa8502 commited on

Fix upload_chunks single-pass batching
875ff44

vxa8502 commited on