Instruction-Following Evaluation for Large Language Models Paper • 2311.07911 • Published Nov 14, 2023 • 22
A Survey on Evaluation of Large Language Models Paper • 2307.03109 • Published Jul 6, 2023 • 43
Runtime error Agents Featured 435 Open Medical-LLM Leaderboard 🥇 435 Explore and submit models for benchmarking
Restarting on CPU Upgrade Agents 76 La Leaderboard 🌸 76 Evaluate open LLMs in the languages of LATAM and Spain.