view article Article ๐งโโ๏ธ "Replacing Judges with Juries" using distilabel May 3, 2024 โข 17
Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models Paper โข 2404.18796 โข Published Apr 29, 2024 โข 71
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models Paper โข 2405.01535 โข Published May 2, 2024 โข 124
Open LLM Leaderboard best models โค๏ธโ๐ฅ Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: โข 50 items โข Updated Mar 13 โข 683