|
|
--- |
|
|
language: |
|
|
- en |
|
|
--- |
|
|
|
|
|
| Prebuilt Wheels | Python Versions | PyTorch Versions | CUDA Versions | Source | |
|
|
|------------------------------------------------|-----------------|------------------|----------------|---------------------------------------------------------------------------| |
|
|
| [Flash-Attention 2.7.4.post1](https://huggingface.co/lym00/win_amd64_prebuilt_wheels/blob/main/flash_attn-2.7.4.post1-cp312-cp312-win_amd64.whl) | 3.12 | 2.8.0.dev | 12.8.1 | [Dao-AILab/flash-attention](https://github.com/Dao-AILab/flash-attention) | |
|
|
| [SageAttention2.2.0](https://huggingface.co/lym00/win_amd64_prebuilt_wheels/blob/main/sageattention-2.2.0-cp312-cp312-win_amd64.whl) | 3.12 | 2.9.0.dev | 12.9.1 | [thu-ml/SageAttention](https://github.com/thu-ml/SageAttention) or [jt-zhang/SageAttention2_plus](https://huggingface.co/jt-zhang/SageAttention2_plus) | |
|
|
| SageAttention3 (pending approval) | 3.12 | 2.9.0.dev | 12.9.1 | [jt-zhang/SageAttention3](https://huggingface.co/jt-zhang/SageAttention3) | |
|
|
| Flash-Attention_2.8.1 | 3.12 | 2.9.0.dev | 12.9.1 | [Dao-AILab/flash-attention](https://github.com/Dao-AILab/flash-attention) | |
|
|
| xformers_0.0.31.post1 | 3.12 | 2.9.0.dev | 12.9.1 | [facebookresearch/xformers](https://github.com/facebookresearch/xformers) | |
|
|
| INSERT | INSERT | INSERT | INSERT | INSERT | |
|
|
|