WildBench-V1-legacy / _header.md
yuchenlin's picture
gradio space
f777be0
|
raw
history blame
330 Bytes

🦁 WildBench: Benchmarking LLMs with Challenging Tasks from Real Users in the Wild

⚙️ GitHub | 🤗 HuggingFace | 💬 Discussions