Running on CPU Upgrade 238 MMLU-Pro Leaderboard ๐ฅ 238 More advanced and challenging multi-task evaluation
Running on CPU Upgrade 13.8k Open LLM Leaderboard ๐ 13.8k Track, rank and evaluate open LLMs and chatbots