Add MDPBench evaluation results

#10

Adds MDPBench benchmark results using the required Eval Result fields only.

Closing in favor of #11, which keeps source attribution on the overall leaderboard entry while avoiding the per-task source validation issue.

Delores-Lin changed pull request status to closed

Sign up or log in to comment