Update README.md
Browse files
README.md
CHANGED
|
@@ -5,9 +5,7 @@ base_model:
|
|
| 5 |
|
| 6 |
modelopt NVFP4 quantized MiniMax-M2.1
|
| 7 |
|
| 8 |
-
|
| 9 |
-
|
| 10 |
-
If you see "No available shared memory broadcast block found in 60 seconds.", be patient.
|
| 11 |
|
| 12 |
Sample docker run (you will want to change this so it's not downloading the model repeatedly by mounting in your HF cache dir):
|
| 13 |
|
|
|
|
| 5 |
|
| 6 |
modelopt NVFP4 quantized MiniMax-M2.1
|
| 7 |
|
| 8 |
+
Works fine on 2x and 4x RTX 6000 Pro Blackwell via vLLM.
|
|
|
|
|
|
|
| 9 |
|
| 10 |
Sample docker run (you will want to change this so it's not downloading the model repeatedly by mounting in your HF cache dir):
|
| 11 |
|