Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

Spaces:
vxa8502
/
Sage
Running

App Files Files Community
Fetching metadata from the HF Docker repository...
Sage / data /eval_results
Ctrl+K
Ctrl+K
  • 1 contributor
History: 3 commits
vxa8502's picture
vxa8502
Calibrate evidence quality gate thresholds
fbc14e7 3 months ago
  • adjusted_faithfulness_20260210_115509.json
    123 Bytes
    Fix eval workflow and align README metrics with eval_results JSON 4 months ago
  • eval_natural_queries_20260210_114459.json
    2.13 kB
    Fix eval workflow and align README metrics with eval_results JSON 4 months ago
  • eval_natural_queries_20260210_114955.json
    463 Bytes
    Fix eval workflow and align README metrics with eval_results JSON 4 months ago
  • eval_natural_queries_20260305_161900_824897.json
    829 Bytes
    Add bootstrap confidence intervals to evaluation metrics 3 months ago
  • failure_analysis_20260210_115508.json
    27.3 kB
    Fix eval workflow and align README metrics with eval_results JSON 4 months ago
  • faithfulness_20260210_115238.json
    1.22 kB
    Fix eval workflow and align README metrics with eval_results JSON 4 months ago
  • grounding_delta_20260210_115418.json
    707 Bytes
    Fix eval workflow and align README metrics with eval_results JSON 4 months ago
  • human_eval_20260210_124705.json
    1.23 kB
    Fix eval workflow and align README metrics with eval_results JSON 4 months ago
  • load_test_20260210_115634.json
    451 Bytes
    Fix eval workflow and align README metrics with eval_results JSON 4 months ago