testing on vLLM

#1
by netwii - opened

it works with sglang, but for vllm I'm getting the following when loading the weights :
(Worker_PP0 pid=242) ERROR 04-19 15:36:30 [multiproc_executor.py:879] param = params_dict[name]
(Worker_PP0 pid=242) ERROR 04-19 15:36:30 [multiproc_executor.py:879] ~~~~~~~~~~~^^^^^^
(Worker_PP0 pid=242) ERROR 04-19 15:36:30 [multiproc_executor.py:879] KeyError: 'layers.0.self_attn.qkv_proj.k_bias'

Sign up or log in to comment