YummyYum commited on
Commit
42cc0b9
·
verified ·
1 Parent(s): c8b90cd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +29 -7
README.md CHANGED
@@ -13,22 +13,22 @@ license: apache-2.0
13
  ## Benchmark Result
14
  |Metrics|MiniMax-M2.7-Nvidia-Origin|MiniMax-M2.7-Nvidia-FlagOS|
15
  |-------|---------------|---------------|
16
- | | | |
17
- | | | |
18
 
19
  # User Guide
20
  Environment Setup
21
 
22
  | Item | Version |
23
  |------------------|----------------------|
24
- | Docker Version | |
25
- | Operating System | |
26
 
27
  ## Operation Steps
28
 
29
  ### Download FlagOS Image
30
  ```bash
31
-
32
  ```
33
 
34
  ### Download Open-source Model Weights
@@ -39,16 +39,38 @@ modelscope download --model FlagRelease/MiniMax-M2.7-nvidia-FlagOS --local_dir /
39
 
40
  ### Start the Container
41
  ```bash
42
-
 
 
 
 
 
 
43
  ```
44
  ### Start the Server
45
  ```bash
46
-
 
 
 
 
 
 
 
 
 
 
47
  ```
48
 
49
  ## Service Invocation
50
  ### Invocation Script
51
  ```bash
 
 
 
 
 
 
52
 
53
  ```
54
 
 
13
  ## Benchmark Result
14
  |Metrics|MiniMax-M2.7-Nvidia-Origin|MiniMax-M2.7-Nvidia-FlagOS|
15
  |-------|---------------|---------------|
16
+ |GPQA_Diamond |0.7071 |0.6313|
17
+ |Aime24 | 0.9 | 0.9|
18
 
19
  # User Guide
20
  Environment Setup
21
 
22
  | Item | Version |
23
  |------------------|----------------------|
24
+ | Docker Version | 24.0.0 |
25
+ | Operating System | Ubuntu 22.04.4 |
26
 
27
  ## Operation Steps
28
 
29
  ### Download FlagOS Image
30
  ```bash
31
+ docker pull harbor.baai.ac.cn/flagrelease-public/flagrelease-nvidia-minimax:202604120915
32
  ```
33
 
34
  ### Download Open-source Model Weights
 
39
 
40
  ### Start the Container
41
  ```bash
42
+ #Container Startup
43
+ docker run --init --detach --net=host --user 0 --ipc=host \
44
+ -v /data:/data --security-opt=seccomp=unconfined \
45
+ --privileged --ulimit=stack=67108864 --ulimit=memlock=-1 \
46
+ --shm-size=512G --gpus all \
47
+ --name flagos harbor.baai.ac.cn/flagrelease-public/flagrelease-nvidia-minimax:202604120915 sleep infinity
48
+ docker exec -it flagos bash
49
  ```
50
  ### Start the Server
51
  ```bash
52
+ vllm serve /data/MiniMax-M2.7 \
53
+ --served-model-name minimax_m2.7 \
54
+ --tensor-parallel-size 8
55
+ --gpu-memory-utilization 0.6 \
56
+ --tool-call-parser minimax_m2 \
57
+ --reasoning-parser minimax_m2_append_think \
58
+ --host 0.0.0.0 \
59
+ --port 8000 \
60
+ --load-format fastsafetensors \
61
+ --enable-auto-tool-choice \
62
+ --trust-remote-code
63
  ```
64
 
65
  ## Service Invocation
66
  ### Invocation Script
67
  ```bash
68
+ curl http://localhost:8000/v1/chat/completions \
69
+ -H "Content-Type: application/json" \
70
+ -d '{
71
+ "model": "minimax_m2.7",
72
+ "messages": [{"role": "user", "content": "你好"}]
73
+ }'
74
 
75
  ```
76