Instructions to use kernels-community/flash-attn2 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Kernels
How to use kernels-community/flash-attn2 with Kernels:
# !pip install kernels from kernels import get_kernel kernel = get_kernel("kernels-community/flash-attn2") - Notebooks
- Google Colab
- Kaggle
fix: add warning for accuracy issue
Browse files
README.md
CHANGED
|
@@ -4,7 +4,10 @@ tags:
|
|
| 4 |
- kernel
|
| 5 |
---
|
| 6 |
|
| 7 |
-

|
|
|
|
|
|
|
|
|
|
| 8 |
|
| 9 |
# Flash Attention
|
| 10 |
|
|
|
|
| 4 |
- kernel
|
| 5 |
---
|
| 6 |
|
| 7 |
+
<!--  -->
|
| 8 |
+
|
| 9 |
+
> [!WARNING]
|
| 10 |
+
> The latest build b58ed97 may contain an accuracy issue, which is currently being addressed. Please use with caution, and be aware that corrected outputs will be available soon.
|
| 11 |
|
| 12 |
# Flash Attention
|
| 13 |
|