tags: - kernels
This CUDA extension implements fused dropout + residual + LayerNorm from the flash-attention repo.