Cluster, Route, Escalate: Cascaded Framework for Cost-Aware LLM Serving Paper • 2606.27457 • Published 6 days ago • 3
Cluster, Route, Escalate: Cascaded Framework for Cost-Aware LLM Serving Paper • 2606.27457 • Published 6 days ago • 3
Dynamic Model Routing and Cascading for Efficient LLM Inference: A Survey Paper • 2603.04445 • Published Apr 21 • 5
Dynamic Model Routing and Cascading for Efficient LLM Inference: A Survey Paper • 2603.04445 • Published Apr 21 • 5
AfriNLLB: Efficient Translation Models for African Languages Paper • 2602.09373 • Published Feb 10 • 3
AfriNLLB Collection AfriNLLB: Efficient Translation Models for African Languages • 11 items • Updated Feb 15 • 5