VibeStudio
/

MiniMax-M2-THRIFT

Mixture of Experts

Model card Files Files and versions

vibestudio-HQ commited on Nov 7, 2025

Commit

57c0d1e

·

verified ·

1 Parent(s): d5e8c7a

Update README.md

Files changed (1) hide show

README.md +60 -0

README.md CHANGED Viewed

@@ -160,7 +160,67 @@ We, over-caffinated researchers at VibeStud.io wanted to create a 50% pruned ver
 | Clinical Knowledge | 92.83% | 85.66% | \-7.17% | ⚠️ Moderate Drop |
 ---
 ## Benchmarks
 Coming soon.

 | Clinical Knowledge | 92.83% | 85.66% | \-7.17% | ⚠️ Moderate Drop |
 ---
+## **Deployment with Python**
+It is recommended to use a virtual environment (such as **venv**, **conda**, or **uv**) to avoid dependency conflicts.
+We recommend installing SGLang in a fresh Python environment:
+```shell
+git clone -b v0.5.4.post1 https://github.com/sgl-project/sglang.git
+cd sglang
+# Install the python packages
+pip install --upgrade pip
+pip install -e "python"
+```
+Run the following command to start the SGLang server. SGLang will automatically download and cache the MiniMax-M2 model from Hugging Face.
+**4-GPU deployment command:**
+```shell
+python -m sglang.launch_server \
+    --model-path MiniMaxAI/MiniMax-M2 \
+    --tp-size 4 \
+    --tool-call-parser minimax-m2 \
+    --reasoning-parser minimax-append-think \
+    --host 0.0.0.0 \
+    --trust-remote-code \
+    --port 8000 \
+    --mem-fraction-static 0.85
+```
+**8-GPU deployment command:**
+```shell
+python -m sglang.launch_server \
+    --model-path MiniMaxAI/MiniMax-M2 \
+    --tp-size 8 \
+    --ep-size 8 \
+    --tool-call-parser minimax-m2 \
+    --trust-remote-code \
+    --host 0.0.0.0 \
+    --reasoning-parser minimax-append-think \
+    --port 8000 \
+    --mem-fraction-static 0.85
+```
+## **Testing Deployment**
+After startup, you can test the SGLang OpenAI-compatible API with the following command:
+```shell
+curl http://localhost:8000/v1/chat/completions \
+    -H "Content-Type: application/json" \
+    -d '{
+        "model": "MiniMaxAI/MiniMax-M2",
+        "messages": [
+            {"role": "system", "content": [{"type": "text", "text": "You are a helpful assistant."}]},
+            {"role": "user", "content": [{"type": "text", "text": "Who won the world series in 2020?"}]}
+        ]
+    }'
+```
 ## Benchmarks
 Coming soon.