JunHowie commited on
Commit
e131f54
·
verified ·
1 Parent(s): a2c03ee

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -21,7 +21,7 @@ Base model: [Qwen/Qwen3-8B](https://huggingface.co/Qwen/Qwen3-8B)
21
  <i>Compared to earlier quantized versions, the new quantized model demonstrates better tokens/s efficiency. This improvement comes from setting desc_act=False in the quantization configuration.</i>
22
 
23
  ```
24
- vllm serve JunHowie/Qwen3-8B-GPTQ-Int8
25
  ```
26
 
27
  ### 【Dependencies】
 
21
  <i>Compared to earlier quantized versions, the new quantized model demonstrates better tokens/s efficiency. This improvement comes from setting desc_act=False in the quantization configuration.</i>
22
 
23
  ```
24
+ vllm serve JunHowie/Qwen3-8B-GPTQ-Int8
25
  ```
26
 
27
  ### 【Dependencies】