MiniMax-M2.7-APEX-I-Compact.gguf completely broken and outputting gibberish

by EclipseMist - opened 1 day ago

•

This quant is broken I don't have this issue with the unsloth UD IQ4_NL quant. I am running it in llama cpp vulkan so its not the cuda issue.

johandevlabs

about 14 hours ago

I have same problem - running MiniMax-M2.7-APEX-I-Mini.gguf

jdarthur

about 13 hours ago

Might be an imatrix-specific problem. I tried the Compact variant and it seemed to work, but I-Compact did not

johandevlabs

about 1 hour ago

That could be the case. Unfortunately for me, the I-Mini is the only one small enough enough to fit on my machine (Evo-X2 with 96gb ram). Speed-wise im getting really good performance (tg128 on llama-bench gives 39 token/s). I also tried the I-Mini of the Minimax M2.5 and that one works from just fine

I have tried llama.cpp compiled both with Vulkan and ROCm, both produce jibberish for M2.7, and appear to work fine for M2.5 (I-Mini).

I will keep digging and let you guys know if i find something

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment