Running on CPU Upgrade 241 MMLU-Pro Leaderboard ๐ฅ 241 More advanced and challenging multi-task evaluation
Running on CPU Upgrade 13.9k Open LLM Leaderboard ๐ 13.9k Track, rank and evaluate open LLMs and chatbots