Play chess between two AI models
OmniParser, turn your LLM into GUI agent
Track, rank and evaluate open LLMs and chatbots