Instructions to use kernels-community/paged-attention with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Kernels
How to use kernels-community/paged-attention with Kernels:
# !pip install kernels from kernels import get_kernel kernel = get_kernel("kernels-community/paged-attention") - Notebooks
- Google Colab
- Kaggle
Commit History
Update readme 16fc7e4
Add metal paged attention ed30f9d
feat: add tag for hfjob build a0903d3 verified
Build 1e0a970
Build (AArch64) 20990f8
Update flake inputs daf6221
Use default CUDA capabilities 0f86240
Fix flake input bebc17e
Build (aarch64) a9bb8f7
Build dde9676
Sync capabilities with upstream 3f98f45
Update flake cea0337
feat: update to include rev in kernel for reproducible symbols 9164b48
drbh commited on