Spaces:

katanemo
/

README

Running

File size: 2,203 Bytes

b4e766d
 
 
 
 
 
 
 
 
2a85d35
92dcf32
2a85d35
1e38b3f
2a85d35
92dcf32
 
 
53f77a4
1e38b3f
211cce3
92dcf32
 
 
211cce3
 
92dcf32
211cce3

---
title: README
emoji: ⚡
colorFrom: green
colorTo: blue
sdk: static
pinned: false
---

The Plano & Arch Family. 

The Plano & Arch family of LLMs are designed to fast and efficient LLMs for common scenarios in agentic application worloads - helping developers stay focused on higher level objectives
of their agents. These scenario include fast agent routing and hand-off, tools calls for common agentic scenarios to improve speed, guadrails and input/output validation of prompts and dynamic routing to LLM based on human preferences. 
The Arch family of LLMs power the intelligence for [Plano](https://github.com/katanemo/plano) (The models-native proxy server and data plane for agents). 

Current

1. Plano-Orchestrator is a family of state-of-the-art routing and orchestration models that decide which agent(s) or LLM(s) should handle each request, and in what sequence. Built for multi-agent orchestration systems, Plano-Orchestrator excels at analyzing user intent and conversation context to make precise routing and orchestration decisions. 
2. Arch-Router: A fast preference-aligned routing model that guides LLM selection by matching queries to user-defined domains (e.g., travel) or action types (e.g., image editing) – offers a practical mechanism to encode preferences in routing decision.
3. Arch-Agent: Designed to power sophisticated multi-step and multi-turn workflows, Arch-Agent excels at handling complex, multi-step tasks that require intelligent tool selection, adaptive planning, and seamless integration with external APIs and services.

History

2. Arch-Function-Chat: A state-of-the-art (SOTA) function calling model also trained to chat - especially useful in scenarios where the model must clarify and refine inputs from the user,
accurately deterime user's downstream intent, and manage decision making in long-form context and complext user interactions. Achieving performance on par with GPT-4. 
1. Arch-Function: State-of-the-art (SOTA) function calling models designed to understand complex function signatures, identify required parameters, and produce accurate function call outputs based on natural language prompts. 
Achieving performance on par with GPT-4.