Safetensors
qwen2
JitaiHao commited on
Commit
779d7fa
·
verified ·
1 Parent(s): 452860c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -0
README.md CHANGED
@@ -18,6 +18,15 @@ LRC-1.7B-Base is a base pre-trained model. While it has not undergone specific S
18
 
19
  ## How to Get Started with the Model
20
 
 
 
 
 
 
 
 
 
 
21
  ```python
22
  from transformers import AutoTokenizer, AutoModelForCausalLM
23
 
 
18
 
19
  ## How to Get Started with the Model
20
 
21
+ **❗ Critical:** For vLLM serving, please specify `model-impl==transformers` when using qwen series model. This is because, in the current implementation of vLLM, the qwen model does not support setting a custom `head_dim` through the config. Fortunately, vLLM allows using transformers as the backend.
22
+
23
+ Tested versions that can serve properly: `vllm==0.8.5.post1` and `transformers==4.51.3`.
24
+
25
+ Serve command:
26
+ ```commandline
27
+ vllm serve JitaiHao/LRC-1.7B-Base --model-impl transformers
28
+ ```
29
+
30
  ```python
31
  from transformers import AutoTokenizer, AutoModelForCausalLM
32