Running Featured 560 Vision Arena (Testing VLMs side-by-side) πΌ 560 Analyze images with multiple vision models for labels and boxes
Running 123 Berkeley Function Calling Leaderboard π 123 View the Berkeley Function-Calling Leaderboard
Running on CPU Upgrade 13.9k Open LLM Leaderboard π 13.9k Track, rank and evaluate open LLMs and chatbots
Running 1.49k Big Code Models Leaderboard π 1.49k Explore and submit code model evaluations on a leaderboard