adasdadsd
/

smallq-flash-attention-ascend

flash-attention

speculative-decoding

Model card Files Files and versions

smallq-flash-attention-ascend / tests

11.8 kB

Ctrl+K

Ctrl+K

1 contributor

History: 14 commits

adasdadsd's picture

Replace with cleaned vector-only implementation

86d4a3b verified about 2 months ago

CMakeLists.txt

708 Bytes
v2.0: Full float32 precision pipeline, headDim=64 support, 448/448 production tests passed about 2 months ago
run_model_tests.py

5.41 kB
Replace with cleaned vector-only implementation about 2 months ago
test_aclnn.cpp

5.65 kB
Replace with cleaned vector-only implementation about 2 months ago