Commit History
Allow device auto map (#8) c41d17d
Truncate to 8k by default (#5) 43f3955
Set max length to 2B 619ca8d
Update README.md a9db862
Update README.md f838124
Allow pytorch<2 to use without passing attn_implementation flag (#4) b5794c5
chore: update from afe81ca705ca1a5bd6b7d90548fcac068850b2af 344bcbc
Team Finetuner commited on
Remove triton flash implementation 5ee2c37
Delete flash_attn_triton.py 4fa2261
chore: update from 896c12d73073854c513200fb74a4887cf25b2b97 96e9a75
Team Finetuner commited on
chore: update from f36c08c8a58c21b5aaab523fa03fb4a24b475612 3e3ced0
Team Finetuner commited on
chore: update from 07ce15d58b77559fce77ea89e92d398f28663bd9 0f4070e
Team Finetuner commited on
feat: allow changing flash implementation 43b8513
Jackmin801 commited on
allow math kernel bc43a5e
Jackmin801 commited on