Update README.md
Browse files
README.md
CHANGED
|
@@ -10,7 +10,7 @@ pipeline_tag: text-generation
|
|
| 10 |
|
| 11 |
To run, please ensure that your CPU supports the AMX instruction set (Intel Xeon processor, Sapphire Rapids or newer), and make note of your NUMA node count. Install `kt-kernal` and `sglang-kt` following the [official documentation](https://github.com/kvcache-ai/ktransformers/blob/main/kt-kernel/README.md).
|
| 12 |
|
| 13 |
-
Then, download
|
| 14 |
|
| 15 |
```
|
| 16 |
PYTORCH_ALLOC_CONF=expandable_segments:True \
|
|
|
|
| 10 |
|
| 11 |
To run, please ensure that your CPU supports the AMX instruction set (Intel Xeon processor, Sapphire Rapids or newer), and make note of your NUMA node count. Install `kt-kernal` and `sglang-kt` following the [official documentation](https://github.com/kvcache-ai/ktransformers/blob/main/kt-kernel/README.md).
|
| 12 |
|
| 13 |
+
Then, download the official weights of MiniMaxAI/MiniMax-M2.5 in [FP8](https://huggingface.co/MiniMaxAI/MiniMax-M2.5), as well as this CPU-optimized quantized model, and prepare your launch command:
|
| 14 |
|
| 15 |
```
|
| 16 |
PYTORCH_ALLOC_CONF=expandable_segments:True \
|