Arena Leaderboard
View the LMArena language model leaderboard
View the LMArena language model leaderboard
Track, rank and evaluate open LLMs and chatbots
Embedding Leaderboard
Explore speech recognition model benchmarks and rankings
Explore LLM performance across hardware configurations
Explore and submit code model evaluations on a leaderboard
View and submit LLM evaluations
Explore and submit LLM benchmarks
Explore AI-powered visual tasks in Vision Arena
Evaluate LLMs' cybersecurity risks and capabilities
View the latest LLM performance leaderboard online
Explore and compare QA and long doc benchmarks
VLMEvalKit Evaluation Results Collection
Explore RewardBench model rankings and scores
Explore code-generation model leaderboards and task details
Display and filter multimodal model leaderboard results
Display MTEB Arena interface
Visualize Open vs. Proprietary LLM Progress
View and compare openβsource AI model rankings with ELO scores
Blind vote on HF TTS models!
A leaderboard for LLMs powering smolagents