Running on CPU Upgrade Agents 604 GAIA Leaderboard π¦Ύ 604 Submit and score your model on the GAIA benchmark
Running on CPU Upgrade 195 LLM Hallucination Leaderboard π 195 View and filter LLM hallucination leaderboard
Jofthomas/hermes-function-calling-thinking-V1 Viewer β’ Updated Feb 16, 2025 β’ 3.57k β’ 620 β’ 77
Running 3.83k The Ultra-Scale Playbook π 3.83k The ultimate guide to training LLM on large GPU Clusters
Running Agents 80 AI Energy Score Leaderboard π 80 Explore AI energy efficiency across various tasks
meta-llama/Llama-3.3-70B-Instruct Text Generation β’ 71B β’ Updated Dec 21, 2024 β’ 827k β’ β’ 2.75k
Running Agents 111 Judge Arena π» 111 View and compare openβsource AI model rankings with ELO scores
Running Featured 595 Image Arena Leaderboard π 595 Image Generation and Image Editing Arena & Leaderboard
meta-llama/Llama-3.1-8B-Instruct Text Generation β’ 8B β’ Updated Sep 25, 2024 β’ 9.58M β’ β’ 5.8k
meta-llama/Llama-3.1-405B-Instruct Text Generation β’ 406B β’ Updated Sep 25, 2024 β’ 246k β’ 595
meta-llama/Llama-3.1-70B-Instruct Text Generation β’ 71B β’ Updated Dec 15, 2024 β’ 733k β’ β’ 910