view article Article "Darwin-27B-Opus: Surpassing the Foundation Model Without Training" 26 days ago ⢠13
view article Article Darwin V6: Diagnostic-Guided Evolutionary Model Merging about 1 month ago ⢠11
view article Article "The Child That Surpassed Both Parents Through MRI-Guided Evolutionary Merge" Mar 31 ⢠14
view article Article Introducing WM Bench: A Benchmark for Cognitive Intelligence in World Models Mar 29 ⢠13
view article Article MARL: Runtime Middleware That Reduces LLM Hallucination Without Fine-Tuning Mar 9 ⢠16
view article Article Structural Problems in AI Benchmarking and the Case for a Unified Evaluation Framework Mar 8 ⢠12
view article Article šļø Smol AI WorldCup: A 5-Axis Benchmark That Reveals What Small Language Models Can Really Do Mar 10 ⢠38