InferBench
🥇
17
A cost/quality/speed Leaderboard for Inference Providers!
Meaningful leaderboards showcasing LLM evaluation results across various tasks and dimensions
A cost/quality/speed Leaderboard for Inference Providers!
Embedding Leaderboard
Track, rank and evaluate open LLMs and chatbots
Compare and rank AI model performance
Evaluate open LLMs in the languages of LATAM and Spain.
Compare and rank AI models through human voting
Compare and find the best LLM performance on different hardware configurations
Compare and rank visual document retrieval models across different benchmarks
VLMEvalKit Evaluation Results Collection
Submit model evaluation results to leaderboard
Submit and evaluate model results on MM-UPD benchmarks
Explore MMBench Leaderboard data