Spaces:
Runtime error
Runtime error
| xFormers optimized operators | |
| ============================================================ | |
| Memory-efficient attention | |
| --------------------------- | |
| .. automodule:: xformers.ops | |
| :members: memory_efficient_attention, AttentionOpBase | |
| :show-inheritance: | |
| :imported-members: | |
| Available implementations | |
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~ | |
| .. automodule:: xformers.ops.fmha.cutlass | |
| :members: FwOp, BwOp | |
| :member-order: bysource | |
| .. automodule:: xformers.ops.fmha.flash | |
| :members: FwOp, BwOp | |
| :member-order: bysource | |
| .. automodule:: xformers.ops.fmha.small_k | |
| :members: FwOp, BwOp | |
| :member-order: bysource | |
| .. automodule:: xformers.ops.fmha.ck | |
| :members: FwOp, BwOp | |
| :member-order: bysource | |
| .. automodule:: xformers.ops.fmha.ck_decoder | |
| :members: FwOp | |
| :member-order: bysource | |
| .. automodule:: xformers.ops.fmha.ck_splitk | |
| :members: FwOp | |
| :member-order: bysource | |
| Attention biases | |
| ~~~~~~~~~~~~~~~~~~~~ | |
| .. automodule:: xformers.ops.fmha.attn_bias | |
| :members: | |
| :show-inheritance: | |
| :member-order: bysource | |
| Partial Attention | |
| ~~~~~~~~~~~~~~~~~~~~ | |
| .. automodule:: xformers.ops.fmha | |
| :members: memory_efficient_attention_partial, merge_attentions | |
| :member-order: bysource | |
| Non-autograd implementations | |
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | |
| .. automodule:: xformers.ops.fmha | |
| :members: memory_efficient_attention_forward, memory_efficient_attention_forward_requires_grad, memory_efficient_attention_backward | |
| :show-inheritance: | |
| :imported-members: | |
| :member-order: bysource | |