ChibuUkachi commited on
Commit
2ff2937
·
verified ·
1 Parent(s): 99c7ddf

update results

Browse files
Files changed (1) hide show
  1. README.md +7 -7
README.md CHANGED
@@ -253,11 +253,11 @@ The model was evaluated on the ifeval, mmlu_pro and gsm8k_platinum using [lm-ev
253
 
254
  ### Accuracy
255
 
256
- | Benchmark | inference-optimization/MiniMax-M2.5-BF16 | inference-optimization/MiniMax-M2.5-NVFP4 | Recovery (%) |
257
- |-----------|------------------------------------------|-------------------------------------------|--------------|
258
- | GSM8k Platinum (0-shot) | 95.15 | 93.91 | 98.70 |
259
- | IfEval (0-shot) | 88.17 | 85.40 | 96.86 |
260
- | AIME 2025 | 87.50 | 77.08 | 88.10 |
261
- | GPQA diamond | 83.67 | 80.30 | 95.98 |
262
- | Math 500 | 87.33 | 87.73 | 100.46 |
263
  | MMLU Pro Chat | 80.83 | 80.08 | 99.07 |
 
253
 
254
  ### Accuracy
255
 
256
+ | Benchmark | inference-optimization/MiniMax-M2.5-BF16 | inference-optimization/MiniMax-M2.5-NVFP4 | Recovery (%) |
257
+ |-----------|------------------------------------------|-------------------------------------------|--------------|
258
+ | GSM8k Platinum (0-shot) | 95.15 | 93.91 | 98.70 |
259
+ | IfEval (0-shot) | 92.05 | 89.89 | 97.66 |
260
+ | AIME 2025 | 87.50 | 77.08 | 88.10 |
261
+ | GPQA diamond | 83.67 | 80.30 | 95.98 |
262
+ | Math 500 | 87.33 | 87.73 | 100.46 |
263
  | MMLU Pro Chat | 80.83 | 80.08 | 99.07 |