Add MDPBench evaluation results

#13
by Delores-Lin - opened

Adds MDPBench benchmark results for the official Hugging Face leaderboard.

Ready to merge
This branch is ready to get merged automatically.

Sign up or log in to comment