Use shared kernels-test-utils and set metal3.1 compatibility 2b765de robtaylor-chipflow commited on 4 days ago
Add Metal CI and point kernel-builder to ChipFlow fork dd92cff robtaylor-chipflow commited on 10 days ago
Improve fp32 precision in rotary embedding Metal kernel 3e008a3 robtaylor-chipflow commited on 10 days ago
Fix MPS command encoder lifecycle for sequential kernel calls 44d8fa4 robtaylor-chipflow commited on 10 days ago
Fix test reference: use scalars to avoid PyTorch view aliasing bug 0bee974 robtaylor-chipflow commited on 11 days ago
Migrate to embedded metallib and update flake.lock ae7295b robtaylor-chipflow commited on 11 days ago
Fix test: capture CPU reference tensors before comparison cf2c51f robtaylor-chipflow commited on 12 days ago
Add Metal rotary embedding kernel matching vLLM interface 949658a robtaylor-chipflow commited on 12 days ago