view article Article "The Child That Surpassed Both Parents Through MRI-Guided Evolutionary Merge" 11 days ago β’ 14
view article Article Introducing WM Bench: A Benchmark for Cognitive Intelligence in World Models 13 days ago β’ 13
view article Article ποΈ Smol AI WorldCup: A 5-Axis Benchmark That Reveals What Small Language Models Can Really Do Mar 10 β’ 38
view article Article MARL: Runtime Middleware That Reduces LLM Hallucination Without Fine-Tuning Mar 9 β’ 15
view article Article Structural Problems in AI Benchmarking and the Case for a Unified Evaluation Framework Mar 8 β’ 12
view article Article Do Bubbles Form When Tens of Thousands of AIs Simulate Capitalism? Feb 24 β’ 17