Full optimizations: mx.fast.rope, SDPA GQA, nn.relu2 ccdb8db verified dnakov commited on Oct 15, 2025
Update with optimized RMSNorm and attention (mx.fast.rms_norm + scaled_dot_product_attention) d3ab9f9 verified dnakov commited on Oct 15, 2025