Papers
arxiv:2605.00180

RouteProfile: Elucidating the Design Space of LLM Profiles for Routing

Published on Apr 30
· Submitted by
Tao Feng
on May 15
Authors:
,
,
,
,
,

Abstract

LLM profiling design significantly impacts routing performance, with structured profiles and query-level signals demonstrating superior reliability and generalization compared to flat profiles and domain-level signals.

AI-generated summary

As the large language model (LLM) ecosystem expands, individual models exhibit varying capabilities across queries, benchmarks, and domains, motivating the development of LLM routing. While prior work has largely focused on router mechanism design, LLM profiles, which capture model capabilities, remain underexplored. In this work, we ask: How does LLM profile design affect routing performance across different routers? Addressing this question helps clarify the role of profiles in routing, disentangle profile design from router design, and enable fairer comparison and more principled development of routing systems. To this end, we view LLM profiling as a structured information integration problem over heterogeneous interaction histories. We develop a general design space of LLM profiles, named RouteProfile, along four key dimensions: organizational form, representation type, aggregation depth, and learning configuration. Through systematic evaluation across three representative routers under both standard and new-LLM generalization settings, we show that: (1) structured profiles consistently outperform flat ones; (2) query-level signals are more reliable than coarse domain-level signals; and (3) generalization to newly introduced models benefits most from structured profiles under trainable configurations. Overall, our work highlights LLM profile design as an important direction for future routing research.

Community

Paper submitter

This paper proposes RouteProfile, a general LLM profile design space spanning four key dimensions, and systematically demonstrates that structured, query-level profiles consistently improve routing performance and generalization to newly introduced models.

Sign up or log in to comment

Get this paper in your agent:

hf papers read 2605.00180
Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2605.00180 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2605.00180 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2605.00180 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.