Kernels

Commit History

feat: add support for ROCm backend and update device validation for HIP and XPU compatibility
a391959

sigmoid-neuron commited on

feat: implement Flash Attention 1 using Triton with PyTorch C++ bindings and test suite
8cf888e

sigmoid-neuron commited on