Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Spaces:
below-threshold
/
ai-response-validator
like
0
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
ai-response-validator
/
eval
66.7 kB
Ctrl+K
Ctrl+K
3 contributors
History:
7 commits
mbochniak01
Replace HHEM with sentence-level NLI, add claim decomposition and drift detection
ffbf46f
about 1 month ago
bot-answers.json
13.3 kB
Faithfulness: mean sentence scoring, strip chunk title prefix, lower threshold to 0.35
about 2 months ago
calibrate.py
2.89 kB
Fix compat, bugs, and types; expand retail KB
about 2 months ago
compare_faithfulness.py
4.15 kB
Replace HHEM with sentence-level NLI, add claim decomposition and drift detection
about 1 month ago
drift.py
6.34 kB
Replace HHEM with sentence-level NLI, add claim decomposition and drift detection
about 1 month ago
golden-dataset.yaml
17.9 kB
Address Gate 5 audit gaps
about 2 months ago
metrics.py
14.1 kB
Fix compat, bugs, and types; expand retail KB
about 2 months ago
simulate_traffic.py
8.01 kB
Replace HHEM with sentence-level NLI, add claim decomposition and drift detection
about 1 month ago