Ali-Yaser
/

Qwen3-R1-8B

Text Generation

text-generation-inference

Model card Files Files and versions

Ali-Yaser commited on about 7 hours ago

Commit

32e20c2

·

verified ·

1 Parent(s): e042ddd

Update README.md

Files changed (1) hide show

README.md +17 -7

README.md CHANGED Viewed

@@ -52,14 +52,24 @@ pip install vllm
 and lets download the model and run model
 ```python
 # Load and run the model:
 vllm serve "Ali-Yaser/Qwen3-R1-8B"
 ```
-#
-- vLLM:
-    ```shell
-    vllm serve Qwen/Qwen3-8B --enable-reasoning --reasoning-parser deepseek_r1
-    ```
-For local use, applications such as Ollama, LMStudio, MLX-LM, llama.cpp, and KTransformers have also supported Qwen3.
-## Switching Between Thinking and Non-Thinking Mode

 and lets download the model and run model
 ```python
 # Load and run the model:
 vllm serve "Ali-Yaser/Qwen3-R1-8B"
 ```
+and Run it this is example
+#
+```
+# Call the server using curl:
+curl -X POST "http://localhost:8000/v1/chat/completions" \
+	-H "Content-Type: application/json" \
+	--data '{
+		"model": "Ali-Yaser/Qwen3-R1-8B",
+		"messages": [
+			{
+				"role": "user",
+				"content": "1+434334434+10x22=?"
+			}
+		]
+	}'
+```