Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Spaces:
Nomearod
/
agentbench
Sleeping

App Files Files Community
Fetching metadata from the HF Docker repository...
agentbench / configs
11.5 kB
Ctrl+K
Ctrl+K
  • 4 contributors
History: 19 commits
Nomearod's picture
Nomearod
calibrate(jury): v1.1+v1.1.1 β€” fix weighting bugs; recency-position paraphrase clause
ab0e054 7 days ago
  • calibration
    calibrate(jury): v1.1+v1.1.1 β€” fix weighting bugs; recency-position paraphrase clause 7 days ago
  • tasks
    feat: Day 1 β€” repo scaffolding, provider abstraction, config, tests about 2 months ago
  • anthropic.yaml
    993 Bytes
    feat: Anthropic Haiku benchmark + README with provider comparison about 2 months ago
  • default.yaml
    3.21 kB
    feat(eval): K8s refusal_threshold sweep against 25Q set β€” 0.015 validated 29 days ago
  • production.yaml
    755 Bytes
    fix: production config with reranker disabled for 512MB free tier about 2 months ago
  • selfhosted_local.yaml
    1.27 kB
    feat: infrastructure sprint β€” vLLM/Modal, Helm, Terraform (#8) about 1 month ago
  • selfhosted_modal.yaml
    1.12 kB
    feat: infrastructure sprint β€” vLLM/Modal, Helm, Terraform (#8) about 1 month ago