Spaces:

samwell
/

medrax2

Paused

samwell Claude commited on 29 days ago

Commit

7a6a9a6

1 Parent(s): f49ba8b

fix: Load NV-Reason-CXR without dtype parameter to avoid JSON error

The 'dtype' parameter in from_pretrained() causes JSON serialization
errors in our environment. Instead, load the model with default settings,
then manually convert to bfloat16 after loading.

This approach achieves the same result as NVIDIA's demo (bfloat16 model)
but avoids the JSON serialization TypeError.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>

Files changed (1) hide show

medrax/tools/nv_reason_cxr.py +6 -5

medrax/tools/nv_reason_cxr.py CHANGED Viewed

@@ -72,12 +72,13 @@ class NVReasonCXRTool(BaseTool):
             print(f"Using device: {self.device}")
             print("Following NVIDIA's exact loading pattern from official demo")
-            # Follow NVIDIA's exact approach from their official Gradio demo
-            # Key: Use 'dtype' parameter (NOT 'torch_dtype' which is deprecated)
             self.model = AutoModelForImageTextToText.from_pretrained(
-                pretrained_model_name_or_path=model_path,
-                dtype=torch.bfloat16,
-            ).eval().to(self.device)
             self.processor = AutoProcessor.from_pretrained(
                 model_path,

             print(f"Using device: {self.device}")
             print("Following NVIDIA's exact loading pattern from official demo")
+            # NVIDIA's approach but adapted for our environment
+            # The 'dtype' parameter works in their Gradio Space environment
+            # but causes JSON serialization issues in our setup.
+            # Solution: Load with default dtype, then convert to bfloat16 manually
             self.model = AutoModelForImageTextToText.from_pretrained(
+                model_path,
+            ).to(dtype=torch.bfloat16).eval().to(self.device)
             self.processor = AutoProcessor.from_pretrained(
                 model_path,