Akicou/MiniMax-M2-5-REAP-29 · Looking forward to the quantized versions

Looking forward to the quantized versions

by Qnibbles - opened 1 day ago

1 day ago

Quantized GGUF Versions would be good, but something that works well on vLLM even more for me, especially NVFP4 to maximize performance on Blackwell RTX PRO. Thanks for the REAPs.

Akicou

Owner 1 day ago

No problem... however The model might have mismatched sensor sizes or something similar within the regard. I'll test loading all variants and then see if I can fix it if they do have a problem.

This could be due to MiniMax having its model set to native fp8. Or due to the pruning process.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment