Instructions to use Intel/llava-gemma-2b with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use Intel/llava-gemma-2b with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("image-text-to-text", model="Intel/llava-gemma-2b")
messages = [
    {
        "role": "user",
        "content": [
            {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"},
            {"type": "text", "text": "What animal is on the candy?"}
        ]
    },
]
pipe(text=messages)

# Load model directly
from transformers import AutoProcessor, AutoModelForImageTextToText

processor = AutoProcessor.from_pretrained("Intel/llava-gemma-2b")
model = AutoModelForImageTextToText.from_pretrained("Intel/llava-gemma-2b")
messages = [
    {
        "role": "user",
        "content": [
            {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"},
            {"type": "text", "text": "What animal is on the candy?"}
        ]
    },
]
inputs = processor.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(processor.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use Intel/llava-gemma-2b with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "Intel/llava-gemma-2b"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Intel/llava-gemma-2b",
		"messages": [
			{
				"role": "user",
				"content": [
					{
						"type": "text",
						"text": "Describe this image in one sentence."
					},
					{
						"type": "image_url",
						"image_url": {
							"url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg"
						}
					}
				]
			}
		]
	}'

Use Docker

docker model run hf.co/Intel/llava-gemma-2b

SGLang

How to use Intel/llava-gemma-2b with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "Intel/llava-gemma-2b" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Intel/llava-gemma-2b",
		"messages": [
			{
				"role": "user",
				"content": [
					{
						"type": "text",
						"text": "Describe this image in one sentence."
					},
					{
						"type": "image_url",
						"image_url": {
							"url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg"
						}
					}
				]
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "Intel/llava-gemma-2b" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Intel/llava-gemma-2b",
		"messages": [
			{
				"role": "user",
				"content": [
					{
						"type": "text",
						"text": "Describe this image in one sentence."
					},
					{
						"type": "image_url",
						"image_url": {
							"url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg"
						}
					}
				]
			}
		]
	}'

Docker Model Runner
How to use Intel/llava-gemma-2b with Docker Model Runner:
```
docker model run hf.co/Intel/llava-gemma-2b
```

llava-gemma-2b

Commit History

Update README.md

dca66d9
verified

PerRing commited on Jul 30, 2024

removed warnings/deprecation errors

d127740
verified

matthewlyleolson commited on Jun 11, 2024

Update README.md

73bc006
verified

matthewlyleolson commited on Jun 10, 2024

Update README.md

b5674bf
verified

matthewlyleolson commited on Jun 10, 2024

Update LICENSE.md

d51c5d1
verified

bconsolvo commited on Jun 4, 2024

Update README.md

8063bbb
verified

bconsolvo commited on Jun 4, 2024

Update LICENSE.md

a3f0eca
verified

bconsolvo commited on Jun 4, 2024

Upload LICENSE.md

c6964a4
verified

bconsolvo commited on Jun 4, 2024

Update README.md

5cde804
verified

bconsolvo commited on May 31, 2024

Update README.md

50a4edc
verified

bconsolvo commited on May 31, 2024

Update README.md

295ac35
verified

bconsolvo commited on May 31, 2024

Update README.md

8ec8773
verified

matthewlyleolson commited on May 15, 2024

fix bug in conversation

53252a4
verified

matthewlyleolson commited on Apr 17, 2024

Remove device args in example code

4b76d8b
verified

musashihinck commited on Apr 9, 2024

Fix links in markdown & additional metadata for license

af50340

djcobble commited on Apr 5, 2024

Updating preprocessor config to LlavaProcessor.py

22045a9

Musashi Hinck commited on Apr 4, 2024

Adding link to arxiv

f8bf4da
verified

musashihinck commited on Apr 3, 2024

Fixing broken link

23b53ec

Musashi Hinck commited on Mar 26, 2024

Adding usage and preprocessing script

3eaf39c

Musashi Hinck commited on Mar 26, 2024

Converting weights

6e532d5
verified

musashihinck commited on Mar 25, 2024

Converting weights

7de365f
verified

musashihinck commited on Mar 25, 2024

adding state_dict

a2ed3a6

shaoyent commited on Mar 22, 2024

Initial model card

cb4912f

Musashi Hinck commited on Mar 22, 2024

adding model weights

5b48a72

shaoyent commited on Mar 21, 2024

Update README.md

d347dfc
verified

matthewlyleolson commited on Mar 14, 2024

initial commit

8358a72
verified

shaoyent commited on Mar 14, 2024

Commit History

Update README.md dca66d9 verified

removed warnings/deprecation errors d127740 verified

Update README.md 73bc006 verified

Update README.md b5674bf verified

Update LICENSE.md d51c5d1 verified

Update README.md 8063bbb verified

Update LICENSE.md a3f0eca verified

Upload LICENSE.md c6964a4 verified

Update README.md 5cde804 verified

Update README.md 50a4edc verified

Update README.md 295ac35 verified

Update README.md 8ec8773 verified

fix bug in conversation 53252a4 verified

Remove device args in example code 4b76d8b verified

Fix links in markdown & additional metadata for license af50340

Updating preprocessor config to LlavaProcessor.py 22045a9

Adding link to arxiv f8bf4da verified

Fixing broken link 23b53ec

Adding usage and preprocessing script 3eaf39c

Converting weights 6e532d5 verified

Converting weights 7de365f verified

adding state_dict a2ed3a6

Initial model card cb4912f

adding model weights 5b48a72

Update README.md d347dfc verified

initial commit 8358a72 verified

Update README.md

dca66d9
verified

removed warnings/deprecation errors

d127740
verified

Update README.md

73bc006
verified

Update README.md

b5674bf
verified

Update LICENSE.md

d51c5d1
verified

Update README.md

8063bbb
verified

Update LICENSE.md

a3f0eca
verified

Upload LICENSE.md

c6964a4
verified

Update README.md

5cde804
verified

Update README.md

50a4edc
verified

Update README.md

295ac35
verified

Update README.md

8ec8773
verified

fix bug in conversation

53252a4
verified

Remove device args in example code

4b76d8b
verified

Fix links in markdown & additional metadata for license

af50340

Updating preprocessor config to LlavaProcessor.py

22045a9

Adding link to arxiv

f8bf4da
verified

Fixing broken link

23b53ec

Adding usage and preprocessing script

3eaf39c

Converting weights

6e532d5
verified

Converting weights

7de365f
verified

adding state_dict

a2ed3a6

Initial model card

cb4912f

adding model weights

5b48a72

Update README.md

d347dfc
verified

initial commit

8358a72
verified