Instructions to use Deep-ML/flash-attention-in-cuda-from-scratch with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Kernels
How to use Deep-ML/flash-attention-in-cuda-from-scratch with Kernels:
# !pip install kernels from kernels import get_kernel kernel = get_kernel("Deep-ML/flash-attention-in-cuda-from-scratch") - Notebooks
- Google Colab
- Kaggle
Welcome to the community
The community tab is the place to discuss and collaborate with the HF community!