Update README.md (#1)

- Update README.md (b07548cf419fe6fa19eabc118c6cdf0275fbe9a3)

Co-authored-by: Lysandre <lysandre@users.noreply.huggingface.co>

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,3 +1,7 @@
 This CUDA extension implements fused dropout + residual + LayerNorm, building on
 Apex's [FastLayerNorm](https://github.com/NVIDIA/apex/tree/master/apex/contrib/layer_norm).
 Major changes:
@@ -17,4 +21,4 @@ cd csrc/layer_norm && pip install .
 As of 2024-01-05, this extension is no longer used in the FlashAttention repo.
 We've instead switched to a Triton-based
-[implementation](https://github.com/Dao-AILab/flash-attention/blob/main/flash_attn/ops/triton/layer_norm.py).

+---
+tags:
+- kernel
+---
 This CUDA extension implements fused dropout + residual + LayerNorm, building on
 Apex's [FastLayerNorm](https://github.com/NVIDIA/apex/tree/master/apex/contrib/layer_norm).
 Major changes:
 As of 2024-01-05, this extension is no longer used in the FlashAttention repo.
 We've instead switched to a Triton-based
+[implementation](https://github.com/Dao-AILab/flash-attention/blob/main/flash_attn/ops/triton/layer_norm.py).