billxbf
/

Nano-Raccoon-Preview-1104

Model card Files Files and versions

billxbf commited on Nov 4, 2025

Commit

aa81b56

·

verified ·

1 Parent(s): ee6c76d

Update README.md

Files changed (1) hide show

README.md +10 -0

README.md CHANGED Viewed

@@ -17,9 +17,19 @@ This model is a light SFT version from Qwen/Qwen3-14B, aimed at stable generativ
 ## Serve with vllm
 ```
 vllm serve billxbf/Nano-Raccoon-Preview-1104 \
   --trust-remote-code \
   --host 0.0.0.0 \
   --port 8000

 ## Serve with vllm
+**Single GPU**
+```
+vllm serve billxbf/Nano-Raccoon-Preview-1104 \
+  --trust-remote-code \
+  --host 0.0.0.0 \
+  --port 8000
+```
+**Use Tensor Parallel on 8xGPU**
 ```
 vllm serve billxbf/Nano-Raccoon-Preview-1104 \
+  --tensor-parallel-size 8 \
   --trust-remote-code \
   --host 0.0.0.0 \
   --port 8000