Kernels
flash-attention-1-triton / flash_attention_kernel
14.2 kB
sigmoid-neuron's picture
feat: add support for ROCm backend and update device validation for HIP and XPU compatibility
a391959