Running on CPU Upgrade Agents 126 Open Chinese LLM Leaderboard π 126 Explore LLM benchmark scores and submit your model for evaluation
Running on CPU Upgrade 14k Open LLM Leaderboard π 14k Track, rank and evaluate open LLMs and chatbots
Running Agents 1.51k Big Code Models Leaderboard π 1.51k Explore and compare code model performance on a leaderboard