baseline-v0
Centroid retrieval router over 265 tool names from public agent benchmarks (tau-bench, Hermes, ToolACE). TF-IDF only, no torch, no GPU, no API key. Smallest of the four pretrained baselines.
Quick start
Install the SDK directly from GitHub (PyPI publish pending):
pip install git+https://github.com/dalek-ai/agent-tool-router.git
from agent_tool_router import Router
r = Router.from_pretrained("baseline-v0")
r.route("cancel my pending order and refund the credit", k=3)
Router.from_pretrained("<name>") resolves the bare name to
dalek-ai/<name> on HuggingFace Hub and caches locally on first call.
Numbers
Cross-corpus, held-out 2 041 tasks, vocab = 265 tools (≥ 3 occurrences):
| metric | model | random | ratio |
|---|---|---|---|
| top-1 | 33.0% | 0.38% | 87.5× |
| top-3 | 63.8% | 1.13% | 56.4× |
| top-5 | 83.0% | 1.89% | 44.0× |
For the long-tail catalog (18 671 tools) prefer
baseline-v1-desc or
baseline-v1-desc-hybrid.
Repo & demo
github.com/dalek-ai/agent-tool-router · MIT. Live demo: dalek-ai/agent-tool-router-demo (gradio Space, FR/EN).