Ali-Yaser commited on
Commit
32e20c2
·
verified ·
1 Parent(s): e042ddd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +17 -7
README.md CHANGED
@@ -52,14 +52,24 @@ pip install vllm
52
 
53
  and lets download the model and run model
54
  ```python
 
55
  # Load and run the model:
56
  vllm serve "Ali-Yaser/Qwen3-R1-8B"
57
  ```
58
- #
59
- - vLLM:
60
- ```shell
61
- vllm serve Qwen/Qwen3-8B --enable-reasoning --reasoning-parser deepseek_r1
62
- ```
63
- For local use, applications such as Ollama, LMStudio, MLX-LM, llama.cpp, and KTransformers have also supported Qwen3.
64
 
65
- ## Switching Between Thinking and Non-Thinking Mode
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
52
 
53
  and lets download the model and run model
54
  ```python
55
+
56
  # Load and run the model:
57
  vllm serve "Ali-Yaser/Qwen3-R1-8B"
58
  ```
 
 
 
 
 
 
59
 
60
+ and Run it this is example
61
+ #
62
+ ```
63
+ # Call the server using curl:
64
+ curl -X POST "http://localhost:8000/v1/chat/completions" \
65
+ -H "Content-Type: application/json" \
66
+ --data '{
67
+ "model": "Ali-Yaser/Qwen3-R1-8B",
68
+ "messages": [
69
+ {
70
+ "role": "user",
71
+ "content": "1+434334434+10x22=?"
72
+ }
73
+ ]
74
+ }'
75
+ ```