Running on CPU Upgrade 551 Visualize Dataset (v2.0+ latest dataset format) ๐ป 551 Visualize LeRobot datasets with interactive charts and tools
Runtime error Agents Featured 151 Open LLM Progress Tracker ๐ฌ 151 Visualize Open vs. Proprietary LLM Progress
Running on CPU Upgrade Agents 251 MMLU-Pro Leaderboard ๐ฅ 251 More advanced and challenging multi-task evaluation
Running on CPU Upgrade 14k Open LLM Leaderboard ๐ 14k Track, rank and evaluate open LLMs and chatbots