Spaces:

jpeper
/

LudoBench_test

Running

Upload folder using huggingface_hub

c9a41bc verified 3 days ago

526 Bytes

title: 'LudoBench: Board Game Reasoning Benchmark'
emoji: 🎲
colorFrom: blue
colorTo: purple
sdk: static
pinned: false
license: mit

LudoBench

A multimodal board-game reasoning benchmark evaluating LLM/VLM reasoning across 5 strategy games and 3 difficulty tiers.