not sure if is only me, but i can't get around this error
never mind, issue on my side with cudaother than that the vllm version is much more fast than the transformers version
· Sign up or log in to comment