Kernels
sigmoid-neuron's picture
feat: implement Flash Attention 1 using Triton with PyTorch C++ bindings and test suite
8cf888e