fix: add linux_aarch64 flash-attn wheel to enable Arm64 builds

#14

The existing linux_x86_64 flash-attn wheel URL (line 32) prevents this Space from building on Arm64 hardware (AWS Graviton, Apple Silicon, NVIDIA Jetson).

This PR adds the equivalent linux_aarch64 wheel from the same release (v0.7.12) alongside the existing x86_64 entry, conditioned on platform_machine == 'aarch64'.

Discovered and validated using Docker MCP Toolkit + Arm MCP Server scan.

Tested wheel: flash_attn-2.8.3+cu128torch2.10-cp311-cp311-linux_aarch64.whl
Source: https://github.com/mjun0812/flash-attention-prebuild-wheels/releases/tag/v0.7.12

Ready to merge
This branch is ready to get merged automatically.

Sign up or log in to comment