lukealonso
/

MiniMax-M2.1-NVFP4

8-bit precision

Model card Files Files and versions

lukealonso commited on 18 days ago

Commit

0abca8e

·

verified ·

1 Parent(s): 55443c9

Update README.md

Files changed (1) hide show

README.md +1 -3

README.md CHANGED Viewed

@@ -5,9 +5,7 @@ base_model:
 modelopt NVFP4 quantized MiniMax-M2.1
-Not yet extensively tested, but does appear to work fine on 2x and 4x RTX 6000 Pro Blackwell via vLLM.
-If you see "No available shared memory broadcast block found in 60 seconds.", be patient.
 Sample docker run (you will want to change this so it's not downloading the model repeatedly by mounting in your HF cache dir):

 modelopt NVFP4 quantized MiniMax-M2.1
+Works fine on 2x and 4x RTX 6000 Pro Blackwell via vLLM.
 Sample docker run (you will want to change this so it's not downloading the model repeatedly by mounting in your HF cache dir):