Running Agents 69 UncheatableEval π 69 Explore model scaling metrics with interactive tables and plots
Meta-Harness: End-to-End Optimization of Model Harnesses Paper β’ 2603.28052 β’ Published Mar 30 β’ 20
Flash-KMeans: Fast and Memory-Efficient Exact K-Means Paper β’ 2603.09229 β’ Published Mar 10 β’ 82