Add evaluation results (HLE, HMMT, MMLU-Pro, SWE-bench Verified)

#2
by SaylorTwift HF Staff - opened
Ready to merge
This branch is ready to get merged automatically.

Sign up or log in to comment