Fix for FlashAttention RuntimeError & Triton Multi GPU fix.
#17
by Satandon1999 - opened
Fix based on the discussion here: https://huggingface.co/microsoft/Phi-3-small-8k-instruct/discussions/11
Satandon1999 changed pull request title from Update positional_embedding.py to Fix for FlashAttention RuntimeError
Satandon1999 changed pull request title from Fix for FlashAttention RuntimeError to Fix for FlashAttention RuntimeError & Triton Multi GPU fix.