Post
457
Just published my first cuda kernel, inspired by Sage Attention. Feel free to try it out ☺️
AINovice2005/attention-int8
AINovice2005/attention-int8
Join the community of Machine Learners and AI enthusiasts.
Sign UpThanks for reaching out.
This is a specialized kernel for diffusion models. Typically, CUDA kernels are used for speeding up inference, distributed training of large models, etc. So, scope of the kernel can differ for each specific process.