🏟️ Smol AI WorldCup: A 5-Axis Benchmark That Reveals What Small Language Models Can Really Do 2 days ago • 33
Structural Problems in AI Benchmarking and the Case for a Unified Evaluation Framework 4 days ago • 12
🏟️ Smol AI WorldCup: A 5-Axis Benchmark That Reveals What Small Language Models Can Really Do 2 days ago • 33
Structural Problems in AI Benchmarking and the Case for a Unified Evaluation Framework 4 days ago • 12