Ali-Yaser commited on
Commit
e042ddd
·
verified ·
1 Parent(s): 06cd72a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -7
README.md CHANGED
@@ -55,17 +55,11 @@ and lets download the model and run model
55
  # Load and run the model:
56
  vllm serve "Ali-Yaser/Qwen3-R1-8B"
57
  ```
58
-
59
- For deployment, you can use `sglang>=0.4.6.post1` or `vllm>=0.8.5` or to create an OpenAI-compatible API endpoint:
60
- - SGLang:
61
- ```shell
62
- python -m sglang.launch_server --model-path Qwen/Qwen3-8B --reasoning-parser qwen3
63
- ```
64
  - vLLM:
65
  ```shell
66
  vllm serve Qwen/Qwen3-8B --enable-reasoning --reasoning-parser deepseek_r1
67
  ```
68
-
69
  For local use, applications such as Ollama, LMStudio, MLX-LM, llama.cpp, and KTransformers have also supported Qwen3.
70
 
71
  ## Switching Between Thinking and Non-Thinking Mode
 
55
  # Load and run the model:
56
  vllm serve "Ali-Yaser/Qwen3-R1-8B"
57
  ```
58
+ #
 
 
 
 
 
59
  - vLLM:
60
  ```shell
61
  vllm serve Qwen/Qwen3-8B --enable-reasoning --reasoning-parser deepseek_r1
62
  ```
 
63
  For local use, applications such as Ollama, LMStudio, MLX-LM, llama.cpp, and KTransformers have also supported Qwen3.
64
 
65
  ## Switching Between Thinking and Non-Thinking Mode