mediastorm / eval_retrieval.py

Commit History

feat(eval): add --pipeline flag for full pipeline eval (retriever + Gemini filter)
69e1201

remdms Claude Opus 4.6 commited on

feat(eval): include expected IDs and missed UIDs in eval results
51fa501

remdms Claude Sonnet 4.6 commited on

feat(eval): add quiet mode to eval_retrieval to avoid duplicate output
6d0ebc3

remdms Claude Sonnet 4.6 commited on

feat: add benchmark.py with full performance baseline
e8e3c0f

remdms Claude Opus 4.6 commited on

refactor: remove Reranker references from all entrypoint files
d43a5d9

remdms Claude Sonnet 4.6 commited on

feat: retrieval evaluation script with router improvements
9d271a0

remdms Claude Opus 4.6 commited on