MiniMax-M2.7 / .eval_results /swe-bench_pro.yaml
nielsr's picture
nielsr HF Staff
Add community evaluation results for SWE-BENCH_PRO, TERMINAL-BENCH-2.0
91ee1ff verified
raw
history blame
168 Bytes
- dataset:
id: ScaleAI/SWE-bench_Pro
task_id: SWE_Bench_Pro
value: 56.2
source:
url: https://huggingface.co/MiniMaxAI/MiniMax-M2.7
name: Model Card