How to use this model?

by bingw5 - opened Apr 27, 2024

I can see there is example code to run the model. But that's for original model. Do I need to modify any parameters or lines to run quantized model?

Owner Apr 27, 2024

Ah, that example code is from the original model card.
The example code of how to run this model is very similar but pointed to this repo. See here: https://huggingface.co/failspy/InternVL-Chat-V1-5-8bit/blob/main/example_inference.py

Running that will be running the quantized model.

I ran into error:

TypeError(\"internvl_chat isn't supported yet.\")

Do you know what's the root cause?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment