Spaces:

CARROT-LLM-Routing
/

README

Running

App Files Files Community

README / README.md

smrstep

Update README.md

44b1dfd verified about 1 year ago

preview code

raw

history blame

2.18 kB

	---
	title: README
	emoji: 📈
	colorFrom: yellow
	colorTo: green
	sdk: static
	pinned: false
	---


	<div class="grid lg:grid-cols-2 gap-x-4 gap-y-7">
	<p class="lg:col-span-3">
	Welcome to CARROT-LLM-Routing! For a given desired trade off between performance and cost,
	CARROT makes it easy to pick the best model among a set of 13 LLMs for any query. Below you may read the CARROT paper, replicate the training process of CARROT, or see how to utilize CARROT out of the box for routing.
	</p>
	<a href="https://arxiv.org/" class="block overflow-hidden group">
	<div
	class="w-40 h-39 object-cover mb-2 rounded-lg flex items-center justify-center bg-[#ECFAFF]"
	>
	<img alt="" src="fmselect_gpt4o_comparison.png" class="w-40" />
	</div>
	<div class="underline">Read the paper</div>
	</a>
	<a
	href="https://github.com/somerstep"
	class="block overflow-hidden"
	>
	<div
	class="w-40 h-39 object-cover mb-2 rounded-lg flex items-center justify-center bg-[#ECFAFF]"
	>
	<img alt="" src="logo.png" class="w-40" />
	</div>
	<div class="underline">Train CARROT</div>
	</a>

	<p class="lg:col-span-3">
	As is, CARROT supports routing to the following collection of large language models.

	\| \| claude-3-5-sonnet-v1 \| titan-text-premier-v1 \| openai-gpt-4o \| openai-gpt-4o-mini \| granite-3-2b-instruct \| granite-3-8b-instruct \| llama-3-1-70b-instruct \| llama-3-1-8b-instruct \| llama-3-2-1b-instruct \| llama-3-2-3b-instruct \| llama-3-3-70b-instruct \| mixtral-8x7b-instruct \| llama-3-405b-instruct \|
	\|----------------------\|---------------------\|----------------------\|---------------\|--------------------\|----------------------\|----------------------\|----------------------\|----------------------\|----------------------\|----------------------\|----------------------\|----------------------\|----------------------\|
	\| Input Token Cost ($ per 1M tokens) \| 3 \| 0.5 \| 2.5 \| 0.15 \| 0.1 \| 0.2 \| 0.9 \| 0.2 \| 0.06 \| 0.06 \| 0.9 \| 0.6 \| 3.5 \|
	\| Output Token Cost ($ per 1M tokens) \| 15 \| 1.5 \| 10 \| 0.6 \| 0.1 \| 0.2 \| 0.9 \| 0.2 \| 0.06 \| 0.06 \| 0.9 \| 0.6 \| 3.5 \|

	</p>

	```python
	your_code = do_some_stuff
	```