Instructions to use royleibov/Qwen2-VL-7B-Instruct-ZipNN-Compressed with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use royleibov/Qwen2-VL-7B-Instruct-ZipNN-Compressed with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("image-text-to-text", model="royleibov/Qwen2-VL-7B-Instruct-ZipNN-Compressed")
messages = [
    {
        "role": "user",
        "content": [
            {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"},
            {"type": "text", "text": "What animal is on the candy?"}
        ]
    },
]
pipe(text=messages)

# Load model directly
from transformers import AutoProcessor, AutoModelForImageTextToText

processor = AutoProcessor.from_pretrained("royleibov/Qwen2-VL-7B-Instruct-ZipNN-Compressed")
model = AutoModelForImageTextToText.from_pretrained("royleibov/Qwen2-VL-7B-Instruct-ZipNN-Compressed")
messages = [
    {
        "role": "user",
        "content": [
            {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"},
            {"type": "text", "text": "What animal is on the candy?"}
        ]
    },
]
inputs = processor.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(processor.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use royleibov/Qwen2-VL-7B-Instruct-ZipNN-Compressed with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "royleibov/Qwen2-VL-7B-Instruct-ZipNN-Compressed"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "royleibov/Qwen2-VL-7B-Instruct-ZipNN-Compressed",
		"messages": [
			{
				"role": "user",
				"content": [
					{
						"type": "text",
						"text": "Describe this image in one sentence."
					},
					{
						"type": "image_url",
						"image_url": {
							"url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg"
						}
					}
				]
			}
		]
	}'

Use Docker

docker model run hf.co/royleibov/Qwen2-VL-7B-Instruct-ZipNN-Compressed

SGLang

How to use royleibov/Qwen2-VL-7B-Instruct-ZipNN-Compressed with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "royleibov/Qwen2-VL-7B-Instruct-ZipNN-Compressed" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "royleibov/Qwen2-VL-7B-Instruct-ZipNN-Compressed",
		"messages": [
			{
				"role": "user",
				"content": [
					{
						"type": "text",
						"text": "Describe this image in one sentence."
					},
					{
						"type": "image_url",
						"image_url": {
							"url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg"
						}
					}
				]
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "royleibov/Qwen2-VL-7B-Instruct-ZipNN-Compressed" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "royleibov/Qwen2-VL-7B-Instruct-ZipNN-Compressed",
		"messages": [
			{
				"role": "user",
				"content": [
					{
						"type": "text",
						"text": "Describe this image in one sentence."
					},
					{
						"type": "image_url",
						"image_url": {
							"url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg"
						}
					}
				]
			}
		]
	}'

Docker Model Runner
How to use royleibov/Qwen2-VL-7B-Instruct-ZipNN-Compressed with Docker Model Runner:
```
docker model run hf.co/royleibov/Qwen2-VL-7B-Instruct-ZipNN-Compressed
```

Qwen2-VL-7B-Instruct-ZipNN-Compressed

Commit History

Add ZipNN support

a571cde
verified

royleibov commited on Sep 15, 2024

Add .znn files

a99e639

royleibov commited on Sep 15, 2024

Update generation_config.json (#22)

3ca981c
verified

chenkq commited on Sep 6, 2024

add vcr results

cacb254
verified

shuai bai commited on Sep 3, 2024

Update README.md (#6)

d776c71
verified

chenkq

jklj077 commited on Sep 2, 2024

add correct pipeline tag (#4)

ccd09ac
verified

chenkq

RaushanTurganbay HF Staff commited on Aug 31, 2024

Update README.md (#3)

ed24a23
verified

chenkq

reach-vb commited on Aug 30, 2024

Update README.md

8f9fc0b
verified

JustinLin610 commited on Aug 29, 2024

Update README.md

6424504
verified

JustinLin610 commited on Aug 29, 2024

Update README.md

6010982
verified

shuai bai commited on Aug 29, 2024

Update README.md

b6241d7
verified

JustinLin610 commited on Aug 29, 2024

Update README.md

1399c6f
verified

shuai bai commited on Aug 29, 2024

Create LICENSE

95aefb3
verified

shuai bai commited on Aug 29, 2024

Create README.md

e1b32dd
verified

shuai bai commited on Aug 29, 2024

Initial commit

f330ecb

yangapku commited on Aug 28, 2024

initial commit

ba2e56f
verified

clonefy commited on Aug 28, 2024

Commit History

Add ZipNN support a571cde verified

Add .znn files a99e639

Update generation_config.json (#22) 3ca981c verified

add vcr results cacb254 verified

Update README.md (#6) d776c71 verified

add correct pipeline tag (#4) ccd09ac verified

Update README.md (#3) ed24a23 verified

Update README.md 8f9fc0b verified

Update README.md 6424504 verified

Update README.md 6010982 verified

Update README.md b6241d7 verified

Update README.md 1399c6f verified

Create LICENSE 95aefb3 verified

Create README.md e1b32dd verified

Initial commit f330ecb

initial commit ba2e56f verified

Add ZipNN support

a571cde
verified

Add .znn files

a99e639

Update generation_config.json (#22)

3ca981c
verified

add vcr results

cacb254
verified

Update README.md (#6)

d776c71
verified

add correct pipeline tag (#4)

ccd09ac
verified

Update README.md (#3)

ed24a23
verified

Update README.md

8f9fc0b
verified

Update README.md

6424504
verified

Update README.md

6010982
verified

Update README.md

b6241d7
verified

Update README.md

1399c6f
verified

Create LICENSE

95aefb3
verified

Create README.md

e1b32dd
verified

Initial commit

f330ecb

initial commit

ba2e56f
verified