Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Spaces:
vxa8502
/
Sage
like
0
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
Sage
/
data
/
eval_results
Ctrl+K
Ctrl+K
1 contributor
History:
3 commits
vxa8502
Calibrate evidence quality gate thresholds
fbc14e7
3 months ago
adjusted_faithfulness_20260210_115509.json
Safe
123 Bytes
Fix eval workflow and align README metrics with eval_results JSON
4 months ago
eval_natural_queries_20260210_114459.json
Safe
2.13 kB
Fix eval workflow and align README metrics with eval_results JSON
4 months ago
eval_natural_queries_20260210_114955.json
Safe
463 Bytes
Fix eval workflow and align README metrics with eval_results JSON
4 months ago
eval_natural_queries_20260305_161900_824897.json
Safe
829 Bytes
Add bootstrap confidence intervals to evaluation metrics
3 months ago
failure_analysis_20260210_115508.json
Safe
27.3 kB
Fix eval workflow and align README metrics with eval_results JSON
4 months ago
faithfulness_20260210_115238.json
Safe
1.22 kB
Fix eval workflow and align README metrics with eval_results JSON
4 months ago
grounding_delta_20260210_115418.json
Safe
707 Bytes
Fix eval workflow and align README metrics with eval_results JSON
4 months ago
human_eval_20260210_124705.json
Safe
1.23 kB
Fix eval workflow and align README metrics with eval_results JSON
4 months ago
load_test_20260210_115634.json
Safe
451 Bytes
Fix eval workflow and align README metrics with eval_results JSON
4 months ago