Running 32 Polish Linguistic and Cultural Competency Benchmark 🏆 32 View a leaderboard of evaluation results
Running on CPU Upgrade Agents 77 Open PL LLM Leaderboard 🏆 77 Explore LLM benchmark leaderboard with searchable filters
Running Agents 14 Polish EQ-Bench Leaderboard 🏆 14 View model benchmark leaderboard with scores and plots
Paused Agents 28 MT Bench PL 📊 28 Przeglądaj i porównuj odpowiedzi modeli językowych w języku polskim