add RAM usage with CacheDataset and GPU consumtion warning

Files changed (3) hide show

README.md CHANGED Viewed

@@ -39,13 +39,25 @@ The segmentation of 104 tissues is formulated as voxel-wise multi-label segmenta
 The training was performed with the following:
-- GPU: 32 GB of GPU memory
 - Actual Model Input: 96 x 96 x 96
 - AMP: True
 - Optimizer: AdamW
 - Learning Rate: 1e-4
 - Loss: DiceCELoss
 ### Input
 One channel
@@ -59,6 +71,12 @@ One channel
 ## Resource Requirements and Latency Benchmarks
 ### High-Resolution and Low-Resolution Models
 We retrained two versions of the totalSegmentator models, following the original paper and implementation.

 The training was performed with the following:
+- GPU: 48 GB of GPU memory
 - Actual Model Input: 96 x 96 x 96
 - AMP: True
 - Optimizer: AdamW
 - Learning Rate: 1e-4
 - Loss: DiceCELoss
+### Memory Consumption
+- Dataset Manager: CacheDataset
+- Data Size: 1000 3D Volumes
+- Cache Rate: 0.4
+- Single GPU - System RAM Usage: 83G
+- Multi GPU (8 GPUs) - System RAM Usage: 666G
+### Memory Consumption Warning
+If you face memory issues with CacheDataset, you can either switch to a regular Dataset class or lower the caching rate `cache_rate` in the configurations within range $(0, 1)$ to minimize the System RAM requirements.
 ### Input
 One channel
 ## Resource Requirements and Latency Benchmarks
+### GPU Consumption Warning
+The model is trained with 104 classes in single instance, for predicting 104 structures, the GPU consumption can be large.
+For inference pipeline, please refer to the following section for benchmarking results. Normally, a CT scans with 300 slices will take about 27G memory, if your CT is larger, please prepare larger GPU memory or use CPU for inference.
 ### High-Resolution and Low-Resolution Models
 We retrained two versions of the totalSegmentator models, following the original paper and implementation.

configs/metadata.json CHANGED Viewed

@@ -1,7 +1,8 @@
 {
     "schema": "https://github.com/Project-MONAI/MONAI-extra-test-data/releases/download/0.8.1/meta_schema_20220324.json",
-    "version": "0.1.5",
     "changelog": {
         "0.1.5": "fix mgpu finalize issue",
         "0.1.4": "Update README Formatting",
         "0.1.3": "add non-deterministic note",

 {
     "schema": "https://github.com/Project-MONAI/MONAI-extra-test-data/releases/download/0.8.1/meta_schema_20220324.json",
+    "version": "0.1.6",
     "changelog": {
+        "0.1.6": "add RAM usage with CacheDataset and GPU consumtion warning",
         "0.1.5": "fix mgpu finalize issue",
         "0.1.4": "Update README Formatting",
         "0.1.3": "add non-deterministic note",

docs/README.md CHANGED Viewed

@@ -32,13 +32,25 @@ The segmentation of 104 tissues is formulated as voxel-wise multi-label segmenta
 The training was performed with the following:
-- GPU: 32 GB of GPU memory
 - Actual Model Input: 96 x 96 x 96
 - AMP: True
 - Optimizer: AdamW
 - Learning Rate: 1e-4
 - Loss: DiceCELoss
 ### Input
 One channel
@@ -52,6 +64,12 @@ One channel
 ## Resource Requirements and Latency Benchmarks
 ### High-Resolution and Low-Resolution Models
 We retrained two versions of the totalSegmentator models, following the original paper and implementation.

 The training was performed with the following:
+- GPU: 48 GB of GPU memory
 - Actual Model Input: 96 x 96 x 96
 - AMP: True
 - Optimizer: AdamW
 - Learning Rate: 1e-4
 - Loss: DiceCELoss
+### Memory Consumption
+- Dataset Manager: CacheDataset
+- Data Size: 1000 3D Volumes
+- Cache Rate: 0.4
+- Single GPU - System RAM Usage: 83G
+- Multi GPU (8 GPUs) - System RAM Usage: 666G
+### Memory Consumption Warning
+If you face memory issues with CacheDataset, you can either switch to a regular Dataset class or lower the caching rate `cache_rate` in the configurations within range $(0, 1)$ to minimize the System RAM requirements.
 ### Input
 One channel
 ## Resource Requirements and Latency Benchmarks
+### GPU Consumption Warning
+The model is trained with 104 classes in single instance, for predicting 104 structures, the GPU consumption can be large.
+For inference pipeline, please refer to the following section for benchmarking results. Normally, a CT scans with 300 slices will take about 27G memory, if your CT is larger, please prepare larger GPU memory or use CPU for inference.
 ### High-Resolution and Low-Resolution Models
 We retrained two versions of the totalSegmentator models, following the original paper and implementation.