Instructions to use sbottazzi/LLaVA-Med_weights with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use sbottazzi/LLaVA-Med_weights with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="sbottazzi/LLaVA-Med_weights")

# Load model directly
from transformers import AutoProcessor, AutoModelForCausalLM

processor = AutoProcessor.from_pretrained("sbottazzi/LLaVA-Med_weights")
model = AutoModelForCausalLM.from_pretrained("sbottazzi/LLaVA-Med_weights")

Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use sbottazzi/LLaVA-Med_weights with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "sbottazzi/LLaVA-Med_weights"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "sbottazzi/LLaVA-Med_weights",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker

docker model run hf.co/sbottazzi/LLaVA-Med_weights

SGLang

How to use sbottazzi/LLaVA-Med_weights with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "sbottazzi/LLaVA-Med_weights" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "sbottazzi/LLaVA-Med_weights",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "sbottazzi/LLaVA-Med_weights" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "sbottazzi/LLaVA-Med_weights",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Docker Model Runner
How to use sbottazzi/LLaVA-Med_weights with Docker Model Runner:
```
docker model run hf.co/sbottazzi/LLaVA-Med_weights
```

Is this the final llava-med weight?

by YiHongping - opened Jan 23, 2024

Discussion

YiHongping

Jan 23, 2024

My merged model keeps reporting errors

sbottazzi

Owner Jan 23, 2024

Hello,
I downloaded the Llama weights from this link https://huggingface.co/nyanko7/LLaMA-7B/tree/main and followed the instructions in the LlaVA-Med delta repo (https://huggingface.co/microsoft/llava-med-7b-delta) but couldn't make it work yet because of GPU issues, so I don't know yet if they work.

YiHongping

Jan 24, 2024

Thank you for your reply！I used your weights andit worked, but I used the weights of llava-med to merge llama-7b-hf and found that it didn't work. Which llama weight did you use?

sbottazzi

Owner Jan 24, 2024

Hello,
I downloaded the Llama weights from this link https://huggingface.co/nyanko7/LLaMA-7B/tree/main. To convert them into HF format, I followed these instructions https://huggingface.co/docs/transformers/main/model_doc/llama#usage-tips . And then keep on with the guide in the LlaVA-Med delta repo (https://huggingface.co/microsoft/llava-med-7b-delta sections "Install" and "Serving"). Also found this youtube tutorial, maybe it's useful for you https://www.youtube.com/watch?v=qfkKz8JYfcc&t=8s.
May I asked how did you make my weights work? Did you use the "Medical Visual Chat " or the "Medical VQA"?

YiHongping

Jan 24, 2024

hello,
I loaded model.worker with your weights, and then used gradient_web for inference, and then it worked. I used my own weights before (similar to the steps you mentioned) and get an error response.So your weights should be correct.

sbottazzi

Owner Feb 19, 2024

Hello YiHongping, I just got back to LlavaMed and wanted to ask you if you could share the script you used to make the model work or if you followed a tutorial. Thank you!

YiHongping

Feb 20, 2024

hello,
I can make inferences directly using your weights, but when I follow the repo tutorial, the inferences will be garbled.

sbottazzi

Owner Feb 20, 2024

This comment has been hidden

YiHongping

Feb 21, 2024

you need to change pydantic 1.10.13 and gradio 3.23.0,and other follows the repo

NithinM-29602

Feb 26, 2024

Hi, I have followed the steps mentioned in the documentation and got the LLaVa-Med weights. Now I need to upload the files and the checkpoints and then create a inference endpoints so that I can use this model with langchain framework to work on my project. Guide me through this

sbottazzi

Owner Feb 27, 2024

Hello, I'm sorry but I have not knowledge at all of langchain framework. I am also trying to run inference but couldn't manage to do it yet, I was trying to use the "eval" files and also the "serve" folder that are on the Github repo. What YiHongping told me about "model_worker" and "gradient_web" in a previous message I think it's reffered to the files in the "serve" folder in the repo.

NithinM-29602

Mar 10, 2024

@sbottazziunsam Thanks for the info, I have used your weights it working quite well.

sbottazzi

Owner Mar 11, 2024

@NithinM-29602 I'm glad they worked. May I ask you how did yo use them? Did you run inference? I'd appreciate if you could give me a detailed description of the stepts you took. Thank you

NithinM-29602

Mar 12, 2024

@NithinM-29602 I just followed this tutorial https://www.youtube.com/watch?v=qfkKz8JYfcc&t=8s. But instead of using the delta weight of mine, I used yours. Usually you need to specify the path of the final llava-med weights in the command and it worked.
Have a Great Day...

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment