Update README.md
Browse files
README.md
CHANGED
|
@@ -18,6 +18,15 @@ LRC-1.7B-Base is a base pre-trained model. While it has not undergone specific S
|
|
| 18 |
|
| 19 |
## How to Get Started with the Model
|
| 20 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 21 |
```python
|
| 22 |
from transformers import AutoTokenizer, AutoModelForCausalLM
|
| 23 |
|
|
|
|
| 18 |
|
| 19 |
## How to Get Started with the Model
|
| 20 |
|
| 21 |
+
**❗ Critical:** For vLLM serving, please specify `model-impl==transformers` when using qwen series model. This is because, in the current implementation of vLLM, the qwen model does not support setting a custom `head_dim` through the config. Fortunately, vLLM allows using transformers as the backend.
|
| 22 |
+
|
| 23 |
+
Tested versions that can serve properly: `vllm==0.8.5.post1` and `transformers==4.51.3`.
|
| 24 |
+
|
| 25 |
+
Serve command:
|
| 26 |
+
```commandline
|
| 27 |
+
vllm serve JitaiHao/LRC-1.7B-Base --model-impl transformers
|
| 28 |
+
```
|
| 29 |
+
|
| 30 |
```python
|
| 31 |
from transformers import AutoTokenizer, AutoModelForCausalLM
|
| 32 |
|