Running on CPU Upgrade 14k Open LLM Leaderboard 🏆 14k Track, rank and evaluate open LLMs and chatbots
Running Agents 68 UncheatableEval 🏆 68 Explore model scaling metrics with interactive tables and plots