PeterKruger commited on
Commit
0c2b56d
·
verified ·
1 Parent(s): 511127b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -12,7 +12,7 @@ license: apache-2.0
12
 
13
  ## Organization Description
14
 
15
- **[AutoBench](https://autobench.org/)** is the premier LLM evaluation and routing infrastructure for the Agentic Era. We are dedicated to solving the LLM evaluation crisis by moving the industry beyond static, domian-rigid, easily gameable text prompts and build the first open LLM-based API Router for the agentic era.
16
 
17
  Pioneering the **"Collective-LLM-as-a-Judge"** methodology, AutoBench uses massive pools of LLMs to dynamically generate tasks, execute multi-turn workflows, and granularly evaluate performance across the AI ecosystem. Today, AutoBench provides fully automated, highly correlated, and strictly un-gameable benchmarking. Furthermore, we leverage the massive synthetic execution datasets generated by our benchmarks to train next-generation **Agentic LLM Routers**, helping agent developers and enterprises optimize for both absolute quality and unit economics.
18
 
 
12
 
13
  ## Organization Description
14
 
15
+ **[AutoBench](https://autobench.org/)** iis the premier LLM evaluation and routing infra for the Agentic Era. We are dedicated to solving the LLM evaluation crisis by moving the industry beyond static, domain-rigid, easily gameable benchmarks and building the next generation LLM-based API routers for the agentic era.
16
 
17
  Pioneering the **"Collective-LLM-as-a-Judge"** methodology, AutoBench uses massive pools of LLMs to dynamically generate tasks, execute multi-turn workflows, and granularly evaluate performance across the AI ecosystem. Today, AutoBench provides fully automated, highly correlated, and strictly un-gameable benchmarking. Furthermore, we leverage the massive synthetic execution datasets generated by our benchmarks to train next-generation **Agentic LLM Routers**, helping agent developers and enterprises optimize for both absolute quality and unit economics.
18