eerrr9 commited on
Commit
45fa889
·
verified ·
1 Parent(s): e4495f7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +30 -0
README.md CHANGED
@@ -40,6 +40,36 @@ These metrics demonstrate robust acceleration performance across diverse and com
40
  ![2](https://hackmd.io/_uploads/S1Da5BLmbl.png)
41
  ![3](https://hackmd.io/_uploads/S1v65HIm-e.png)
42
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
43
 
44
  ## Training Data
45
 
 
40
  ![2](https://hackmd.io/_uploads/S1Da5BLmbl.png)
41
  ![3](https://hackmd.io/_uploads/S1v65HIm-e.png)
42
 
43
+ ## Quick Start
44
+
45
+ ### Requirements
46
+
47
+ - NVIDIA GPU
48
+ - CUDA 12.0+
49
+ - PyTorch 2.0+
50
+
51
+ ### Installation
52
+
53
+ ```bash
54
+ pip install sglang==0.5.6
55
+ ```
56
+
57
+ ### Inference with SGLang
58
+
59
+ ```python
60
+ python3 -m sglang.launch_server \
61
+ --model-path /models/Kimi-K2-Instruct \
62
+ --host 0.0.0.0 --port 30012 \
63
+ --trust-remote-code \
64
+ --attention-backend fa3 \
65
+ --mem-fraction-static 0.9 \
66
+ --tp-size 1 \
67
+ --speculative-algorithm EAGLE3 \
68
+ --speculative-draft-model-path AQ-MedAI/Kimi-K2-Instruct-eagle3 \
69
+ --speculative-num-steps 3 \
70
+ --speculative-eagle-topk 1 \
71
+ --speculative-num-draft-tokens 4
72
+ ```
73
 
74
  ## Training Data
75