Speed and code quality benchmarks for coding LLMs on Apple Silicon. HumanEval+ results for 21 models.