substrate / eval

Commit History

refactor: extract llm-as-judge logic to reusable module
5839f38

Syed Taha commited on

fix: correct total query count in test_queries.yaml
42117b5

Syed Taha commited on

add script to evaluate rag and evaluation results
7db3d04

Syed Taha commited on

add baseline and retrieval eval results
57f0dc1

Syed Taha commited on

refactor: update eval_baseline.py to integrate Ollama for local LLM calls and improve usage instructions
0d5ef14

Syed Taha commited on

refactor: update eval_retrieval.py to support chunking strategy argument
eb951a8

Syed Taha commited on

add baseline eval script and results
246aed2

syedtaha22 commited on

add __init__ files for eval and ablation
b15cf5c

syedtaha22 commited on

Update test_queries.yaml to version 3.0 and retrieval criteria
48249d6

syedtaha22 commited on

add script to calibrate test cases.
0bc7ef7

syedtaha22 commited on

add test script to evaluate retriaval methodologies. also add script output
d056ae7

syedtaha22 commited on

add test queries
efa0cfa

syedtaha22 commited on