Ali-Yaser
/

Qwen3-R1-8B

Text Generation

text-generation-inference

Model card Files Files and versions

Ali-Yaser commited on about 10 hours ago

Commit

e042ddd

·

verified ·

1 Parent(s): 06cd72a

Update README.md

Files changed (1) hide show

README.md +1 -7

README.md CHANGED Viewed

@@ -55,17 +55,11 @@ and lets download the model and run model
 # Load and run the model:
 vllm serve "Ali-Yaser/Qwen3-R1-8B"
 ```
-For deployment, you can use `sglang>=0.4.6.post1` or `vllm>=0.8.5` or to create an OpenAI-compatible API endpoint:
-- SGLang:
-    ```shell
-    python -m sglang.launch_server --model-path Qwen/Qwen3-8B --reasoning-parser qwen3
-    ```
 - vLLM:
     ```shell
     vllm serve Qwen/Qwen3-8B --enable-reasoning --reasoning-parser deepseek_r1
     ```
 For local use, applications such as Ollama, LMStudio, MLX-LM, llama.cpp, and KTransformers have also supported Qwen3.
 ## Switching Between Thinking and Non-Thinking Mode

 # Load and run the model:
 vllm serve "Ali-Yaser/Qwen3-R1-8B"
 ```
+#
 - vLLM:
     ```shell
     vllm serve Qwen/Qwen3-8B --enable-reasoning --reasoning-parser deepseek_r1
     ```
 For local use, applications such as Ollama, LMStudio, MLX-LM, llama.cpp, and KTransformers have also supported Qwen3.
 ## Switching Between Thinking and Non-Thinking Mode