Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
|
@@ -34,9 +34,7 @@ pinned: false
|
|
| 34 |
</a>
|
| 35 |
|
| 36 |
<p class="lg:col-span-3">
|
| 37 |
-
As is, CARROT supports routing to the collection of large language models provided in the table below. Instantiating the CarrotRouter class automatically loads the trained predictors for ouput token count and performance that are hosted in the CARROT-LLM-Router model repositories. Note that you must provide a hugging face token with access to the Llama-3 herd of models.
|
| 38 |
-
|
| 39 |
-
To control your desired cost performance tradeoff, you provide the router with an argument between 0 and 1 for mu; a smaller mu will prioritize perofrmance. Happy routing!
|
| 40 |
|
| 41 |
| | claude-3-5-sonnet-v1 | titan-text-premier-v1 | openai-gpt-4o | openai-gpt-4o-mini | granite-3-2b-instruct | granite-3-8b-instruct | llama-3-1-70b-instruct | llama-3-1-8b-instruct | llama-3-2-1b-instruct | llama-3-2-3b-instruct | llama-3-3-70b-instruct | mixtral-8x7b-instruct | llama-3-405b-instruct |
|
| 42 |
|----------------------|---------------------|----------------------|---------------|--------------------|----------------------|----------------------|----------------------|----------------------|----------------------|----------------------|----------------------|----------------------|----------------------|
|
|
|
|
| 34 |
</a>
|
| 35 |
|
| 36 |
<p class="lg:col-span-3">
|
| 37 |
+
As is, CARROT supports routing to the collection of large language models provided in the table below. Instantiating the CarrotRouter class automatically loads the trained predictors for ouput token count and performance that are hosted in the CARROT-LLM-Router model repositories. Note that you must provide a hugging face token with access to the Llama-3 herd of models. To control your desired cost performance tradeoff, you provide the router with an argument between 0 and 1 for mu; a smaller mu will prioritize perofrmance. Happy routing!
|
|
|
|
|
|
|
| 38 |
|
| 39 |
| | claude-3-5-sonnet-v1 | titan-text-premier-v1 | openai-gpt-4o | openai-gpt-4o-mini | granite-3-2b-instruct | granite-3-8b-instruct | llama-3-1-70b-instruct | llama-3-1-8b-instruct | llama-3-2-1b-instruct | llama-3-2-3b-instruct | llama-3-3-70b-instruct | mixtral-8x7b-instruct | llama-3-405b-instruct |
|
| 40 |
|----------------------|---------------------|----------------------|---------------|--------------------|----------------------|----------------------|----------------------|----------------------|----------------------|----------------------|----------------------|----------------------|----------------------|
|