I have a general misunderstanding which vllm is used for intel's autoround models here? Is it llm-scaler? is it vanilla vllm? Which version?Would it work with single b60?Thank you!
· Sign up or log in to comment