Precompute eval matrices for multi-metric + per-slice leaderboards 553b175 evijit HF Staff Claude Opus 4.7 (1M context) commited on 17 days ago