Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Spaces:
syedtaha22
/
substrate
like
0
Sleeping
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
substrate
/
eval
1.43 MB
Ctrl+K
Ctrl+K
3 contributors
History:
12 commits
Syed Taha
refactor: extract llm-as-judge logic to reusable module
5839f38
about 1 month ago
results
add script to evaluate rag and evaluation results
about 1 month ago
__init__.py
0 Bytes
add __init__ files for eval and ablation
about 2 months ago
calibrate_test_cases.py
Safe
6.2 kB
add script to calibrate test cases.
about 2 months ago
eval_baseline.py
Safe
10.1 kB
refactor: update eval_baseline.py to integrate Ollama for local LLM calls and improve usage instructions
about 1 month ago
eval_rag.py
Safe
13.7 kB
refactor: extract llm-as-judge logic to reusable module
about 1 month ago
eval_retrieval.py
Safe
15.2 kB
refactor: update eval_retrieval.py to support chunking strategy argument
about 1 month ago
test_queries.yaml
Safe
18.2 kB
fix: correct total query count in test_queries.yaml
about 1 month ago