Instructions to use gorilla-llm/gorilla-openfunctions-v2 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use gorilla-llm/gorilla-openfunctions-v2 with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="gorilla-llm/gorilla-openfunctions-v2")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("gorilla-llm/gorilla-openfunctions-v2")
model = AutoModelForCausalLM.from_pretrained("gorilla-llm/gorilla-openfunctions-v2")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use gorilla-llm/gorilla-openfunctions-v2 with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "gorilla-llm/gorilla-openfunctions-v2"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "gorilla-llm/gorilla-openfunctions-v2",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/gorilla-llm/gorilla-openfunctions-v2

SGLang

How to use gorilla-llm/gorilla-openfunctions-v2 with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "gorilla-llm/gorilla-openfunctions-v2" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "gorilla-llm/gorilla-openfunctions-v2",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "gorilla-llm/gorilla-openfunctions-v2" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "gorilla-llm/gorilla-openfunctions-v2",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use gorilla-llm/gorilla-openfunctions-v2 with Docker Model Runner:
```
docker model run hf.co/gorilla-llm/gorilla-openfunctions-v2
```

gorilla-openfunctions-v2

Commit History

Update tokenizer_config.json, correcting DeepSeek to Gorilla LLM

1f6ac3b
verified

CharlieJi commited on Apr 18, 2024

Update README.md

0f91d70
verified

CharlieJi commited on Mar 10, 2024

Update README.md

a37bc6d
verified

CharlieJi commited on Mar 9, 2024

Update README.md

eedf71a
verified

CharlieJi commited on Mar 9, 2024

Update README.md

436f19e
verified

CharlieJi commited on Mar 9, 2024

Update README.md

0383bee
verified

CharlieJi commited on Mar 8, 2024

Update README.md

040ab93
verified

CharlieJi commited on Mar 8, 2024

Update README.md

1deaa30
verified

CharlieJi commited on Mar 8, 2024

Update README with the local inference update

bb0fe27
verified

shishirpatil commited on Mar 6, 2024

Update README.md

3fd971c
verified

CharlieJi commited on Mar 4, 2024

Upload 2 files

ceff6f1
verified

CharlieJi commited on Mar 4, 2024

fixed README \n issue

21354e3

CharlieJi commited on Mar 3, 2024

update README

37d1cd9

CharlieJi commited on Mar 3, 2024

[add] README instruction on file dependencies, linked to github

68a6ac1

CharlieJi commited on Mar 3, 2024

Update README.md

673a424
verified

tianjunz commited on Mar 1, 2024

model upload

6225f84

Shishir Patil commited on Feb 26, 2024

initial commit

0302bf4
verified

shishirpatil commited on Feb 26, 2024

Commit History

Update tokenizer_config.json, correcting DeepSeek to Gorilla LLM 1f6ac3b verified

Update README.md 0f91d70 verified

Update README.md a37bc6d verified

Update README.md eedf71a verified

Update README.md 436f19e verified

Update README.md 0383bee verified

Update README.md 040ab93 verified

Update README.md 1deaa30 verified

Update README with the local inference update bb0fe27 verified

Update README.md 3fd971c verified

Upload 2 files ceff6f1 verified

fixed README \n issue 21354e3

update README 37d1cd9

[add] README instruction on file dependencies, linked to github 68a6ac1

Update README.md 673a424 verified

model upload 6225f84

initial commit 0302bf4 verified

Update tokenizer_config.json, correcting DeepSeek to Gorilla LLM

1f6ac3b
verified

Update README.md

0f91d70
verified

Update README.md

a37bc6d
verified

Update README.md

eedf71a
verified

Update README.md

436f19e
verified

Update README.md

0383bee
verified

Update README.md

040ab93
verified

Update README.md

1deaa30
verified

Update README with the local inference update

bb0fe27
verified

Update README.md

3fd971c
verified

Upload 2 files

ceff6f1
verified

fixed README \n issue

21354e3

update README

37d1cd9

[add] README instruction on file dependencies, linked to github

68a6ac1

Update README.md

673a424
verified

model upload

6225f84

initial commit

0302bf4
verified