fev-bench / tables /domain_cloud /leaderboard_SQL.csv
shchuro's picture
Update results per group
169601b
model_name,win_rate,skill_score,median_training_time_s_per100,median_inference_time_s_per100,training_corpus_overlap,num_failures
Toto-1.0,91.78571428571428,63.05871546601058,0.0,45.868307151375,0.0,0.0
Chronos-2,91.42857142857143,63.92789671191335,0.0,1.18621670825,0.0,0.0
TimesFM-2.5,83.57142857142857,61.98663044796851,0.0,6.4447616408988475,0.0,0.0
TiRex,80.35714285714285,60.87156591539495,0.0,0.20031914146825397,0.0,0.0
Moirai-2.0,68.21428571428572,58.41610987284718,0.0,0.36445902017857146,0.1,0.0
Sundial-Base,59.64285714285715,56.647735223836015,0.0,7.874095355196429,0.0,0.0
Chronos-Bolt,58.57142857142858,54.85929055283013,0.0,0.21195593629285714,0.0,0.0
TabPFN-TS,57.50000000000001,53.05789136035014,0.0,187.1612375475248,0.0,0.0
AutoARIMA,38.03571428571428,32.58752067086065,0.0,17.96621433673913,0.0,15.0
Stat. Ensemble,34.82142857142856,20.1052436267542,0.0,117.23392411285715,0.0,15.0
AutoETS,28.035714285714285,-26.853793211441058,0.0,2.879931537857143,0.0,15.0
AutoTheta,22.142857142857146,-2.1145171067117996,0.0,3.978930596261481,0.0,0.0
Seasonal Naive,19.107142857142854,0.0,0.0,0.43848143432692305,0.0,0.0
Naive,13.571428571428573,-102.14853969808834,0.0,0.40027213541999274,0.0,0.0
Drift,3.214285714285715,-110.2431874466197,0.0,0.4187055400961539,0.0,0.0