Toolkits (DUPLICATE them, never use the public ones) A set of tools to enable finetuning, evaluations, prototyping, agentic workflows etc. ATTENTION: ALWAYS DUPLICATE THESE SPACES ON OUR INFRA!!! Running 123 AutoTrain Advanced 🚀 123 Create powerful AI models without code Runtime error 39 LLM Merge Adapter 🐢 39 Runtime error Featured 289 mergekit-gui 🔀 289 Merge AI models using a YAML configuration file
Benchmarks Most commonly used leaderboards to check model capabilities Running on CPU Upgrade 13.9k Open LLM Leaderboard 🏆 13.9k Track, rank and evaluate open LLMs and chatbots Running Featured 447 LLM Performance Leaderboard 🐨 447 View the latest LLM performance leaderboard online Running 4.8k Arena Leaderboard 🏆 4.8k View the LMArena leaderboard of language model rankings Running on CPU Upgrade 7.17k MTEB Leaderboard 🥇 7.17k Embedding Leaderboard
Running on CPU Upgrade 13.9k Open LLM Leaderboard 🏆 13.9k Track, rank and evaluate open LLMs and chatbots
Running Featured 447 LLM Performance Leaderboard 🐨 447 View the latest LLM performance leaderboard online
Toolkits (DUPLICATE them, never use the public ones) A set of tools to enable finetuning, evaluations, prototyping, agentic workflows etc. ATTENTION: ALWAYS DUPLICATE THESE SPACES ON OUR INFRA!!! Running 123 AutoTrain Advanced 🚀 123 Create powerful AI models without code Runtime error 39 LLM Merge Adapter 🐢 39 Runtime error Featured 289 mergekit-gui 🔀 289 Merge AI models using a YAML configuration file
Benchmarks Most commonly used leaderboards to check model capabilities Running on CPU Upgrade 13.9k Open LLM Leaderboard 🏆 13.9k Track, rank and evaluate open LLMs and chatbots Running Featured 447 LLM Performance Leaderboard 🐨 447 View the latest LLM performance leaderboard online Running 4.8k Arena Leaderboard 🏆 4.8k View the LMArena leaderboard of language model rankings Running on CPU Upgrade 7.17k MTEB Leaderboard 🥇 7.17k Embedding Leaderboard
Running on CPU Upgrade 13.9k Open LLM Leaderboard 🏆 13.9k Track, rank and evaluate open LLMs and chatbots
Running Featured 447 LLM Performance Leaderboard 🐨 447 View the latest LLM performance leaderboard online