affine-win-13 / evaluation /eval /get_scores_gpqa.py

Commit History

updated models (2026-01-06)
ebe2ddc
verified

affineawe commited on