Running on CPU Upgrade 238 MMLU-Pro Leaderboard 🥇 238 More advanced and challenging multi-task evaluation
Running 1.48k Big Code Models Leaderboard 📈 1.48k Explore and compare code generation models on a leaderboard