Update model card: #1 in Optimal Accuracy Score (88.7%) on RouterArena leaderboard
Browse files
README.md
CHANGED
|
@@ -11,7 +11,6 @@ language:
|
|
| 11 |
- en
|
| 12 |
metrics:
|
| 13 |
- accuracy
|
| 14 |
-
license: apache-2.0
|
| 15 |
---
|
| 16 |
|
| 17 |
# Chayan: Multi-Model LLM Router
|
|
@@ -20,7 +19,18 @@ license: apache-2.0
|
|
| 20 |
|
| 21 |
## Performance
|
| 22 |
|
| 23 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 24 |
- **$0.333 per 1K queries** (estimated cost)
|
| 25 |
- **+7.62pp improvement** over baseline 2-model router
|
| 26 |
- Achieves **99% of theoretical perfect oracle performance**
|
|
@@ -154,16 +164,24 @@ predictions = router.predict(augmented, k=4)
|
|
| 154 |
|
| 155 |
## RouterArena Leaderboard
|
| 156 |
|
| 157 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
| 158 |
|
| 159 |
-
| Rank | Router | Accuracy |
|
| 160 |
-
|
| 161 |
-
| 1 |
|
| 162 |
-
| 2 |
|
| 163 |
-
| 3 |
|
| 164 |
-
|
|
| 165 |
|
| 166 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
| 167 |
|
| 168 |
## Technical Insights
|
| 169 |
|
|
@@ -198,18 +216,22 @@ K-NN classifiers are sensitive to class imbalance. By applying calibration facto
|
|
| 198 |
If you use Chayan in your research or applications, please cite:
|
| 199 |
|
| 200 |
```bibtex
|
| 201 |
-
@software{
|
| 202 |
-
title = {
|
| 203 |
-
author = {
|
| 204 |
year = {2025},
|
| 205 |
-
|
| 206 |
-
|
| 207 |
}
|
| 208 |
```
|
| 209 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 210 |
## Links
|
| 211 |
|
| 212 |
- **Model Repository**: https://huggingface.co/adaptive-classifier/chayan
|
| 213 |
- **Library**: https://github.com/codelion/adaptive-classifier
|
| 214 |
- **RouterArena**: https://routeworks.github.io/
|
| 215 |
-
- **RouterArena Paper**: https://arxiv.org/abs/2510.00202
|
|
|
|
| 11 |
- en
|
| 12 |
metrics:
|
| 13 |
- accuracy
|
|
|
|
| 14 |
---
|
| 15 |
|
| 16 |
# Chayan: Multi-Model LLM Router
|
|
|
|
| 19 |
|
| 20 |
## Performance
|
| 21 |
|
| 22 |
+
🏆 **#1 on RouterArena Leaderboard in Optimal Accuracy Score**
|
| 23 |
+
|
| 24 |
+
Official RouterArena Full Dataset Results (8,400 queries):
|
| 25 |
+
- **88.7% Optimal Accuracy Score** - 🥇 SOTA! Ranked #1 in this category
|
| 26 |
+
- **64.9% Overall Accuracy** - #1 among open-source routers
|
| 27 |
+
- **Arena Score: 63.8**
|
| 28 |
+
- **$0.60 per 1K queries** - Cost-efficient routing
|
| 29 |
+
|
| 30 |
+
The **Optimal Accuracy Score** measures how often the router makes the right routing decision - when Chayan selects a model for a query, that model provides the correct answer 88.7% of the time.
|
| 31 |
+
|
| 32 |
+
Sub_10 Benchmark (809 queries):
|
| 33 |
+
- **69.05% accuracy**
|
| 34 |
- **$0.333 per 1K queries** (estimated cost)
|
| 35 |
- **+7.62pp improvement** over baseline 2-model router
|
| 36 |
- Achieves **99% of theoretical perfect oracle performance**
|
|
|
|
| 164 |
|
| 165 |
## RouterArena Leaderboard
|
| 166 |
|
| 167 |
+
🏆 **Official Results - #1 in Optimal Accuracy Score Category**
|
| 168 |
+
|
| 169 |
+

|
| 170 |
+
|
| 171 |
+
Chayan on the official [RouterArena leaderboard](https://routeworks.github.io/):
|
| 172 |
|
| 173 |
+
| Rank (Overall) | Router | Arena Score | Accuracy | **Opt. Acc** | Cost/1k | Type |
|
| 174 |
+
|----------------|--------|-------------|----------|--------------|---------|------|
|
| 175 |
+
| 1 | **Chayan** | 63.8 | 64.9% | **88.7%** 🥇 | $0.60 | Open-Source |
|
| 176 |
+
| 2 | RouterBench-MLP | 57.6 | 61.6% | 83.3% | $4.80 | Open-Source |
|
| 177 |
+
| 3 | Azure | 66.7 | 68.1% | 82.0% | $0.50 | Closed-Source |
|
| 178 |
+
| 4 | vLLM-SR | 64.3 | 67.3% | 79.3% | $1.70 | Open-Source |
|
| 179 |
|
| 180 |
+
**🥇 SOTA Achievement - Optimal Accuracy Score Category**: Chayan achieves **88.7% Optimal Accuracy**, ranking **#1** in this critical metric across all routers on the leaderboard.
|
| 181 |
+
|
| 182 |
+
**What is Optimal Accuracy Score?** This metric measures routing decision quality - when Chayan selects a model for a query, that model provides the correct answer 88.7% of the time. This is the highest score among all evaluated routers, demonstrating Chayan's superior model selection capability.
|
| 183 |
+
|
| 184 |
+
View the full leaderboard and PR: [RouterArena PR #24](https://github.com/RouteWorks/RouterArena/pull/24)
|
| 185 |
|
| 186 |
## Technical Insights
|
| 187 |
|
|
|
|
| 216 |
If you use Chayan in your research or applications, please cite:
|
| 217 |
|
| 218 |
```bibtex
|
| 219 |
+
@software{chayan_router_2025,
|
| 220 |
+
title = {Chayan: Calibrated Multi-Model LLM Router},
|
| 221 |
+
author = {Adaptive Classifier Team},
|
| 222 |
year = {2025},
|
| 223 |
+
url = {https://huggingface.co/adaptive-classifier/chayan},
|
| 224 |
+
note = {High-performance LLM router achieving 69.05\% accuracy on RouterArena}
|
| 225 |
}
|
| 226 |
```
|
| 227 |
|
| 228 |
+
## License
|
| 229 |
+
|
| 230 |
+
MIT License
|
| 231 |
+
|
| 232 |
## Links
|
| 233 |
|
| 234 |
- **Model Repository**: https://huggingface.co/adaptive-classifier/chayan
|
| 235 |
- **Library**: https://github.com/codelion/adaptive-classifier
|
| 236 |
- **RouterArena**: https://routeworks.github.io/
|
| 237 |
+
- **RouterArena Paper**: https://arxiv.org/abs/2510.00202
|