baseline-v1-desc

Centroid retrieval router over 18 671 tools, scored by cosine(task_tfidf, description_tfidf). TF-IDF only, ~6 MB, no torch, no GPU. The default model behind pip install agent-tool-router.

Quick start

Install the SDK directly from GitHub (PyPI publish pending):

pip install git+https://github.com/dalek-ai/agent-tool-router.git
from agent_tool_router import Router
r = Router.from_pretrained("baseline-v1-desc")
r.route("cancel my pending order and refund the credit", k=3)
# ['refundOrder', 'modify_pending_order_items', 'cancel_pending_order']

Numbers

Per-call top-3 against the full 18 671-tool catalog (n=30 425 calls, held-out across the corpus). Random baseline = 3/V = 0.016%:

source n calls top-3
Hermes function-calling-v1 4 376 74.3%
ToolACE 17 169 52.4%
tau-bench 8 880 3.2%
overall 30 425 41.2%

For Pareto-better top-3 across all three sources at the cost of ~250 MB of torch + sentence-transformers, switch to baseline-v1-desc-hybrid.

Repo & demo

github.com/dalek-ai/agent-tool-router · MIT. Live demo: dalek-ai/agent-tool-router-demo (gradio Space, FR/EN).

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support