MiniMax-M2.7-MXFP4 / README.md
ColinZ22's picture
Create README.md
f1c5e1e verified
|
raw
history blame
1.67 kB
metadata
base_model:
  - MiniMaxAI/MiniMax-M2.7
language:
  - en
library_name: transformers
license: other
license_name: modified-mit
license_link: https://huggingface.co/MiniMaxAI/MiniMax-M2.7/blob/main/LICENSE

Model Overview

  • Model Architecture: MiniMaxM2ForCausalLM
    • Input: Text
    • Output: Text
  • Supported Hardware Microarchitecture: AMD MI300 MI350/MI355
  • ROCm: ---
  • PyTorch: ---
  • Transformers: ---
  • Operating System(s): Linux
  • Inference Engine: SGLang/vLLM
  • Model Optimizer: AMD-Quark
    • Weight quantization: OCP MXFP4, Static
    • Activation quantization: OCP MXFP4, Dynamic

Model Quantization

The model was quantized from MiniMaxAI/MiniMax-M2.7 using AMD-Quark. The weights are quantized to MXFP4 and activations are quantized to MXFP4.

Quantization scripts: TBD

For further details or issues, please refer to the AMD-Quark documentation or contact the respective developers.

Evaluation

TBD

Accuracy

Benchmark MiniMaxAI/MiniMax-M2.7 amd/MiniMax-M2.7-MXFP4(this model) Recovery
gsm8k (flexible-extract) TBD TBD TBD

Reproduction

TBD

License

Modifications Copyright(c) 2026 Advanced Micro Devices, Inc. All rights reserved.