judge_id,judge_name,elo_score,wins,losses,total_evaluations,organization,license,parameters qualifire-eval,Qualifire,1724.8384234654231,40.0,4.0,44.0,Qualifire,Proprietary,400M claude-3-haiku-20240307,Claude 3 Haiku,1558.9789022015404,4.0,1.0,5.0,Anthropic,Proprietary, claude-3-5-haiku-latest,Claude 3.5 Haiku,1553.2109613480875,3.0,0.0,3.0,Anthropic,Proprietary, qwen-2.5-7b-instruct-turbo,Qwen 2.5 7B Instruct,1543.37554446099,3.0,0.0,3.0,Alibaba,Open Source, meta-llama-3.1-70b-instruct-turbo,Meta Llama 3.1 70B Instruct,1535.5696544480506,6.0,3.0,9.0,Meta,Open Source, gpt-3.5-turbo,GPT-3.5 Turbo,1530.628203437139,2.0,0.0,2.0,OpenAI,Proprietary, claude-3-sonnet-20240229,Claude 3 Sonnet,1528.1056355333478,2.0,1.0,3.0,Anthropic,Proprietary, meta-llama-4-scout-17B-16E-instruct,Meta Llama 4 Scout 17B 16E Instruct,1516.2892092665088,2.0,2.0,4.0,Meta,Open Source, qwen-2.5-72b-instruct-turbo,Qwen 2.5 72B Instruct,1515.1480974364024,1.0,0.0,1.0,Alibaba,Open Source, mistral-7b-instruct-v0.1,Mistral (7B) Instruct v0.1,1500.0,0.0,0.0,0.0,Mistral AI,Open Source, judge5,Mixtral,1500.0,0.0,0.0,0.0,Mistral AI,Commercial, qwen-2-72b-instruct,Qwen 2 Instruct (72B),1500.0,0.0,0.0,0.0,Alibaba,Open Source, gpt-4-turbo,GPT-4 Turbo,1499.7217358602074,1.0,1.0,2.0,OpenAI,Proprietary, gemma-2-27b-it,Gemma 2 27B,1484.736306793522,0.0,1.0,1.0,Google,Open Source, claude-3-opus-latest,Claude 3 Opus,1483.8496849577325,1.0,3.0,4.0,Anthropic,Proprietary, gpt-4o,GPT-4o,1483.5476042607663,1.0,3.0,4.0,OpenAI,Proprietary, meta-llama-3.1-405b-instruct-turbo,Meta Llama 3.1 405B Instruct,1480.7273197431043,1.0,5.0,6.0,Meta,Open Source, mistral-7b-instruct-v0.3,Mistral (7B) Instruct v0.3,1478.3323551088422,0.0,2.0,2.0,Mistral AI,Open Source, claude-3-5-sonnet-latest,Claude 3.5 Sonnet,1477.6257758061242,2.0,4.0,6.0,Anthropic,Proprietary, gpt-4.1,GPT-4.1,1468.847220765222,0.0,2.0,2.0,OpenAI,Proprietary, deepseek-v3,DeepSeek V3,1466.4505035965371,0.0,3.0,3.0,DeepSeek,Open Source, deepseek-r1,DeepSeek R1,1466.3355627816525,1.0,4.0,5.0,DeepSeek,Open Source, o3-mini, o3-mini,1456.1196407049383,0.0,3.0,3.0,OpenAI,Proprietary, meta-llama-3.3-70B-instruct-turbo,Meta Llama 4 Scout 32K Instruct,1455.5159399586212,0.0,3.0,3.0,Meta,Open Source, meta-llama-3.1-8b-instruct-turbo,Meta Llama 3.1 8B Instruct,1448.2672724548872,0.0,6.0,6.0,Meta,Open Source, gemma-2-9b-it,Gemma 2 9B,1322.1433222517269,3.0,28.0,31.0,Google,Open Source,