Instructions to use dphn/dolphin-vision-7b with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use dphn/dolphin-vision-7b with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="dphn/dolphin-vision-7b", trust_remote_code=True)
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoModelForCausalLM
model = AutoModelForCausalLM.from_pretrained("dphn/dolphin-vision-7b", trust_remote_code=True, dtype="auto")

Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use dphn/dolphin-vision-7b with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "dphn/dolphin-vision-7b"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "dphn/dolphin-vision-7b",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/dphn/dolphin-vision-7b

SGLang

How to use dphn/dolphin-vision-7b with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "dphn/dolphin-vision-7b" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "dphn/dolphin-vision-7b",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "dphn/dolphin-vision-7b" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "dphn/dolphin-vision-7b",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use dphn/dolphin-vision-7b with Docker Model Runner:
```
docker model run hf.co/dphn/dolphin-vision-7b
```

The checkpoint you are trying to load has model type `bunny-qwen` but Transformers does not recognize this architecture.

by catworld1212 - opened Jul 20, 2024

Discussion

catworld1212

Jul 20, 2024

I am getting this error when I try to run this model.

The checkpoint you are trying to load has model type bunny-qwen but Transformers does not recognize this
architecture. This could be because of an issue with the checkpoint, or because your version of Transformers is out of date.

catworld1212

Jul 20, 2024

Any fix @ehartford

qnguyen3

Jul 20, 2024

@catworld1212 you need to pass trust_remote_code = True into AutoModelForCausalLM.from_pretrained

catworld1212

Jul 20, 2024

@qnguyen3 I did still same error please check my code:

    import torch
        import transformers
        from transformers import AutoModelForCausalLM, AutoTokenizer
        from PIL import Image
        import warnings
        import requests
        # disable some warnings
        transformers.logging.set_verbosity_error()
        transformers.logging.disable_progress_bar()
        warnings.filterwarnings('ignore')

        # set device
        torch.set_default_device('cuda')  # or 'cpu'

        model_name = 'cognitivecomputations/dolphin-vision-7b'

        # create model
        model = AutoModelForCausalLM.from_pretrained(
            model_name,
            torch_dtype=torch.float16,
            device_map='auto',
            trust_remote_code=True)
        tokenizer = AutoTokenizer.from_pretrained(
            model_name,
            trust_remote_code=True)

        # text prompt
        prompt = 'Describe this image in detail'

        messages = [
            {"role": "user", "content": f'<image>\n{prompt}'}
        ]
        text = tokenizer.apply_chat_template(
            messages,
            tokenize=False,
            add_generation_prompt=True
        )

        print(text)

        text_chunks = [tokenizer(chunk).input_ids for chunk in text.split('<image>')]
        input_ids = torch.tensor(text_chunks[0] + [-200] + text_chunks[1], dtype=torch.long).unsqueeze(0)

        # image, sample images can be found in images folder
        image = Image.open(requests.get(
            "https://uploads4.wikiart.org/images/paul-klee/death-for-the-idea-1915.jpg!Large.jpg", stream=True).raw)
        image_tensor = model.process_images([image], model.config).to(dtype=model.dtype)

        # generate
        output_ids = model.generate(
            input_ids,
            images=image_tensor,
            max_new_tokens=2048,
            use_cache=True)[0]

        print(tokenizer.decode(output_ids[input_ids.shape[1]:], skip_special_tokens=True).strip())

nicolollo

Jul 20, 2024

catworld1212

Jul 21, 2024

@qnguyen3 Any fix?

FundropInteractive

Jul 21, 2024

This comment has been hidden

catworld1212

Jul 23, 2024

hi @qnguyen3 can you pls test the model?

qnguyen3

Jul 23, 2024

yes i will try to test it tonight

qnguyen3

Jul 23, 2024

@catworld1212 should be fixed, could you give it a check?

catworld1212

Jul 24, 2024

•

edited Jul 24, 2024

@qnguyen3 it works

saishf

Jul 24, 2024

Works for me too :3

The image portrays a contemporary living room bathed in natural light from a large window on the left. Dominating the center of the room is a beige sofa, comfortably flanked by a blue armchair and a wooden side table, each adorned with a plant, adding a touch of greenery to the space.

Above the sofa, four framed pictures of insects are neatly arranged on the white wall, creating a striking contrast against the plain background. To the right of the sofa stands a wooden bookshelf, its top decorated with an array of pottery and other objects, adding a personal touch to the room.

A wooden floor stretches out across the room, leading to a black and white striped rug that lies on the wood, adding a patterned element to the otherwise neutral palette. The room is a harmonious blend of modern design and personal style, with each piece of furniture and decor contributing to the overall aesthetic.