Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
below-threshold
/
ai-response-validator
like
0
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
ai-response-validator
/
eval
Ctrl+K
Ctrl+K
3 contributors
History:
5 commits
below-threshold
Faithfulness: mean sentence scoring, strip chunk title prefix, lower threshold to 0.35
cd30e2d
3 days ago
bot-answers.json
Safe
13.3 kB
Faithfulness: mean sentence scoring, strip chunk title prefix, lower threshold to 0.35
3 days ago
calibrate.py
Safe
2.89 kB
Address Gate 5 audit gaps
6 days ago
golden-dataset.yaml
Safe
17.9 kB
Address Gate 5 audit gaps
6 days ago
metrics.py
Safe
14.2 kB
Add Makefile, HTML eval report generator, gitignore for reports
6 days ago