lujangusface commited on
Commit
2fcf070
·
verified ·
1 Parent(s): c5e920c

fix: correct pip install command and --dtype fp8 -> --quantization fp8

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -31,7 +31,7 @@ Requires our [SGLang fork](https://github.com/tails-mpt/sglang) for MiniMax-M2.5
31
  **B=1 server** (wide tree — optimal for single-user, real-time requests):
32
 
33
  ```bash
34
- pip install git+https://github.com/tails-mpt/sglang.git
35
 
36
  python -m sglang.launch_server \
37
  --model-path MiniMaxAI/MiniMax-M2.5 \
@@ -40,7 +40,7 @@ python -m sglang.launch_server \
40
  --speculative-num-steps 3 \
41
  --speculative-num-draft-tokens 8 \
42
  --speculative-eagle-topk 4 \
43
- --dtype fp8 \
44
  --tp 4 \
45
  --port 30000
46
  ```
@@ -55,7 +55,7 @@ python -m sglang.launch_server \
55
  --speculative-num-steps 5 \
56
  --speculative-num-draft-tokens 6 \
57
  --speculative-eagle-topk 1 \
58
- --dtype fp8 \
59
  --tp 4 \
60
  --port 30002
61
  ```
 
31
  **B=1 server** (wide tree — optimal for single-user, real-time requests):
32
 
33
  ```bash
34
+ pip install 'git+https://github.com/tails-mpt/sglang.git#subdirectory=python'
35
 
36
  python -m sglang.launch_server \
37
  --model-path MiniMaxAI/MiniMax-M2.5 \
 
40
  --speculative-num-steps 3 \
41
  --speculative-num-draft-tokens 8 \
42
  --speculative-eagle-topk 4 \
43
+ --quantization fp8 \
44
  --tp 4 \
45
  --port 30000
46
  ```
 
55
  --speculative-num-steps 5 \
56
  --speculative-num-draft-tokens 6 \
57
  --speculative-eagle-topk 1 \
58
+ --quantization fp8 \
59
  --tp 4 \
60
  --port 30002
61
  ```