Spaces:

H-Liu1997
/

FloodDiffusion-Streaming

Running on T4

H-Liu1997 commited on 9 days ago

Commit

7d73321

1 Parent(s): 6c6483b

fix: move k_lens to GPU in SDPA fallback (tested locally)

Files changed (1) hide show

model_manager.py CHANGED Viewed

@@ -130,6 +130,7 @@ class ModelManager:
             "        attn_mask = None\n"
             "        is_causal_flag = causal\n"
             "        if k_lens is not None:\n"
             "            valid = torch.arange(lk, device=q.device).unsqueeze(0) < k_lens.unsqueeze(1)\n"
             "            attn_mask = torch.where(valid[:, None, None, :], 0.0, float('-inf')).to(dtype=dtype)\n"
             "            is_causal_flag = False\n"

             "        attn_mask = None\n"
             "        is_causal_flag = causal\n"
             "        if k_lens is not None:\n"
+            "            k_lens = k_lens.to(q.device)\n"
             "            valid = torch.arange(lk, device=q.device).unsqueeze(0) < k_lens.unsqueeze(1)\n"
             "            attn_mask = torch.where(valid[:, None, None, :], 0.0, float('-inf')).to(dtype=dtype)\n"
             "            is_causal_flag = False\n"