Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Nomearod
/
agentbench
like
0
Sleeping
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
agentbench
/
configs
11.5 kB
Ctrl+K
Ctrl+K
4 contributors
History:
19 commits
Nomearod
calibrate(jury): v1.1+v1.1.1 β fix weighting bugs; recency-position paraphrase clause
ab0e054
7 days ago
calibration
calibrate(jury): v1.1+v1.1.1 β fix weighting bugs; recency-position paraphrase clause
7 days ago
tasks
feat: Day 1 β repo scaffolding, provider abstraction, config, tests
about 2 months ago
anthropic.yaml
Safe
993 Bytes
feat: Anthropic Haiku benchmark + README with provider comparison
about 2 months ago
default.yaml
Safe
3.21 kB
feat(eval): K8s refusal_threshold sweep against 25Q set β 0.015 validated
29 days ago
production.yaml
Safe
755 Bytes
fix: production config with reranker disabled for 512MB free tier
about 2 months ago
selfhosted_local.yaml
Safe
1.27 kB
feat: infrastructure sprint β vLLM/Modal, Helm, Terraform (#8)
about 1 month ago
selfhosted_modal.yaml
Safe
1.12 kB
feat: infrastructure sprint β vLLM/Modal, Helm, Terraform (#8)
about 1 month ago