VELA / benchmarks
2.83 kB
intrect's picture
data: add raw benchmark results JSON (KMMLU + HAE-RAE, 3-model comparison)
de3e104 verified