Getting the following error with VLLM: KeyError: 'ministral3'

by avishekjana - opened Dec 2, 2025

Dec 2, 2025

vllm-server-1 | (APIServer pid=1) text_config = CONFIG_MAPPINGtext_config["model_type"]
vllm-server-1 | (APIServer pid=1) ~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^
vllm-server-1 | (APIServer pid=1) File "/usr/local/lib/python3.12/dist-packages/transformers/models/auto/configuration_auto.py", line 1049, in getitem
vllm-server-1 | (APIServer pid=1) raise KeyError(key)
vllm-server-1 | (APIServer pid=1) KeyError: 'ministral3'

Docker image: vllm/vllm-openai:latest

theblackcat102

Dec 2, 2025

•

edited Dec 2, 2025

~~Transformers version needed update:~~

>>> import transformers
>>> from transformers.models.auto import CONFIG_MAPPING
>>> CONFIG_MAPPING['ministral3']
<class 'transformers.models.ministral3.configuration_ministral3.Ministral3Config'>
>>> transformers.__version__
'5.0.0.dev0'

You need to ensure you add these flag at the end:

vllm serve ... --tokenizer_mode mistral --config_format mistral --load_format mistral \
  --enable-auto-tool-choice --tool-call-parser mistral

cointeleporting

Dec 17, 2025

I just encountered similar issue when trying to SFT ministral with HF transformers and trainer.
If this config can solve the problem, I guess either Mistral or HF should update their libraries.

sonseca

18 days ago

I just encountered similar issue when trying to SFT ministral with HF transformers and trainer.
If this config can solve the problem, I guess either Mistral or HF should update their libraries.

Hi, I met the same issue when loading SFT-ed ministral model. I have to remove --load_format mistral from the command, but then the inference results from vllm is rather different from directly inferencing using model.generate. Do you know what might be the reason? Thanks in advance!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment