Generates incoherent outputs for me with VLLM 0.18

#5
by catplusplus - opened

Like Chinese characters and not answering my actual question. Any KIs/workarounds?

This is on NVIDIA Thor dev kit

Sign up or log in to comment