Error Loading using vLLM

#1
by suleimanelkhoury - opened

Hi, running the model using vLLM returns the following error:
AttributeError: CachedMistralCommonBackend has no attribute is_fast

vLLM is also detecting the architecture falsely: Resolved architecture: TransformersMultiModalForCausalLM

as if vLLM is programmed to only detect the source mistralai/Voxtral-Mini-4B-Realtime-2602 repository. Do you encounter the same issue?

Sign up or log in to comment