Spaces:

samwell
/

medrax2

Paused

App Files Files Community

VictorLJZ commited on Jul 9, 2025

Commit

775f52a

1 Parent(s): 707b83e

updated prompt and documentation

Browse files

Files changed (2) hide show

README.md +61 -4
medrax/tools/medsam2.py +3 -0

README.md CHANGED Viewed

@@ -22,7 +22,7 @@ MedRAX is built on a robust technical foundation:
 ### Integrated Tools
 - **Visual QA**: Utilizes CheXagent and LLaVA-Med for complex visual understanding and medical reasoning
-- **Segmentation**: Employs MedSAM and PSPNet model trained on ChestX-Det for precise anatomical structure identification
 - **Grounding**: Uses Maira-2 for localizing specific findings in medical images
 - **Report Generation**: Implements SwinV2 Transformer trained on CheXpert Plus for detailed medical reporting
 - **Disease Classification**: Leverages DenseNet-121 from TorchXRayVision for detecting 18 pathology classes
@@ -227,10 +227,17 @@ XRayVQATool(
 ```
 - CheXagent weights download automatically
-### MedSAM Tool
-```
-Support for MedSAM segmentation will be added in a future update.
 ```
 ### Python Sandbox Tool
 ```python
@@ -256,6 +263,56 @@ WebBrowserTool()  # Requires Google Search API credentials
 ## Manual Setup Required
 ### Image Generation Tool
 ```python
 ChestXRayGeneratorTool(

 ### Integrated Tools
 - **Visual QA**: Utilizes CheXagent and LLaVA-Med for complex visual understanding and medical reasoning
+- **Segmentation**: Employs MedSAM2 (advanced medical image segmentation) and PSPNet model trained on ChestX-Det for precise anatomical structure identification
 - **Grounding**: Uses Maira-2 for localizing specific findings in medical images
 - **Report Generation**: Implements SwinV2 Transformer trained on CheXpert Plus for detailed medical reporting
 - **Disease Classification**: Leverages DenseNet-121 from TorchXRayVision for detecting 18 pathology classes
 ```
 - CheXagent weights download automatically
+### MedSAM2 Tool
+```python
+MedSAM2Tool(
+    model_dir=model_dir,
+    device=device,
+    temp_dir=temp_dir
+)
 ```
+- Advanced medical image segmentation using MedSAM2 (adapted from Meta's SAM2)
+- Supports interactive prompting with box coordinates, point clicks, or automatic segmentation
+- **Requires manual setup** - see setup instructions below
 ### Python Sandbox Tool
 ```python
 ## Manual Setup Required
+### MedSAM2 Tool
+```python
+MedSAM2Tool(
+    model_dir=model_dir,
+    device=device,
+    temp_dir=temp_dir,
+    model_cfg="sam2.1_hiera_t512.yaml",  # Optional: model configuration
+    checkpoint="MedSAM2_latest.pt"       # Optional: specific checkpoint
+)
+```
+**MedSAM2 Manual Setup Instructions:**
+1. **Clone MedSAM2 Repository**:
+   ```bash
+   cd model-weights
+   git clone https://github.com/bowang-lab/MedSAM2.git
+   cd MedSAM2
+   ```
+2. **Install Dependencies**:
+   ```bash
+   # Install PyTorch with CUDA support (adjust CUDA version as needed)
+   pip install torch==2.5.1 torchvision==0.20.1 --index-url https://download.pytorch.org/whl/cu124
+   # Install MedSAM2 package
+   pip install -e ".[dev]"
+   ```
+3. **Download Model Checkpoints**:
+   ```bash
+   # Run the download script to get all MedSAM2 checkpoints
+   bash download.sh
+   ```
+   This downloads:
+   - `MedSAM2_latest.pt` (recommended) - Latest general-purpose model
+   - `MedSAM2_CTLesion.pt` - Specialized for CT lesion segmentation
+   - `MedSAM2_MRI_LiverLesion.pt` - Specialized for liver lesion MRI
+   - `MedSAM2_US_Heart.pt` - Specialized for heart ultrasound
+   - Additional EfficientTAM and SAM2 base checkpoints
+**Configuration Options:**
+- `model_cfg`: Model configuration file (default: `"sam2.1_hiera_t512.yaml"`)
+- `checkpoint`: Checkpoint file to use:
+  - `"MedSAM2_latest.pt"` - Best general-purpose model (recommended)
+  - `"MedSAM2_CTLesion.pt"` - For CT lesion segmentation
+  - `"MedSAM2_MRI_LiverLesion.pt"` - For liver MRI segmentation
+  - `"MedSAM2_US_Heart.pt"` - For heart ultrasound segmentation
 ### Image Generation Tool
 ```python
 ChestXRayGeneratorTool(

medrax/tools/medsam2.py CHANGED Viewed

@@ -50,6 +50,9 @@ class MedSAM2Tool(BaseTool):
         "Supports interactive prompting with box coordinates, point clicks, or automatic segmentation. "
         "Can handle 2D medical images and 3D volumes. Returns segmentation masks and visualization overlays. "
         "Prompt types: 'box' with [x1,y1,x2,y2] coordinates, 'point' with [x,y] coordinates, or 'auto' for automatic. "
         "Example: {'image_path': '/path/to/image.png', 'prompt_type': 'box', 'prompt_coords': [100,100,200,200]}"
     )
     args_schema: Type[BaseModel] = MedSAM2Input

         "Supports interactive prompting with box coordinates, point clicks, or automatic segmentation. "
         "Can handle 2D medical images and 3D volumes. Returns segmentation masks and visualization overlays. "
         "Prompt types: 'box' with [x1,y1,x2,y2] coordinates, 'point' with [x,y] coordinates, or 'auto' for automatic. "
+        "Don't use auto segmentation for everything, try to use point or box prompts by estimating the coordinates of the object you want to segment."
+        "Think step by step and reason about the coordinates carefully, also consider the size of the image itself and the object in the image."
+        "Also be aware of the mirroring effect (e.g. the right lung is on the left side of the image, the left lung is on the right side of the image)."
         "Example: {'image_path': '/path/to/image.png', 'prompt_type': 'box', 'prompt_coords': [100,100,200,200]}"
     )
     args_schema: Type[BaseModel] = MedSAM2Input