Darwin Family: MRI-Trust-Weighted Evolutionary Merging for Training-Free Scaling of Language-Model Reasoning Paper โข 2605.14386 โข Published 2 days ago โข 45
view article Article Introducing WM Bench: A Benchmark for Cognitive Intelligence in World Models FINAL-Bench โข Mar 29 โข 13
view article Article ๐๏ธ Smol AI WorldCup: A 5-Axis Benchmark That Reveals What Small Language Models Can Really Do FINAL-Bench โข Mar 10 โข 38