Spaces:

SmartHeal
/

SmartHeal-Agentic-AI

Running

SmartHeal commited on Aug 18

Commit

2b90bf7

verified ·

1 Parent(s): e4c8717

Update src/ai_processor.py

Files changed (1) hide show

src/ai_processor.py CHANGED Viewed

@@ -149,7 +149,7 @@ def _vlm_infer_gpu(messages, model_id: str, max_new_tokens: int, token: Optional
         task="image-text-to-text",
         model=model_id,
         torch_dtype=torch.bfloat16, # Use torch_dtype from the working example
-        device_map="auto",            # CUDA init happens here, safely in GPU worker
         token=token,
         trust_remote_code=True,
         model_kwargs={"low_cpu_mem_usage": True},

         task="image-text-to-text",
         model=model_id,
         torch_dtype=torch.bfloat16, # Use torch_dtype from the working example
+        device_map=0,            # CUDA init happens here, safely in GPU worker
         token=token,
         trust_remote_code=True,
         model_kwargs={"low_cpu_mem_usage": True},