InferBench
๐ฅ
18
A cost/quality/speed Leaderboard for Inference Providers!
Meaningful leaderboards showcasing LLM evaluation results across various tasks and dimensions
A cost/quality/speed Leaderboard for Inference Providers!
Embedding Leaderboard
Track, rank and evaluate open LLMs and chatbots
View the LMArena leaderboard in fullโscreen
Evaluate open LLMs in the languages of LATAM and Spain.
View and compare openโsource AI model rankings with ELO scores
Compare LLM hardware performance and find the best model
Browse and compare visual document retrieval model scores
VLMEvalKit Evaluation Results Collection
Submit model evaluation results to leaderboard
Submit and evaluate model results on MM-UPD benchmarks
Explore MMBench Leaderboard data