masonwang025
/

deeptfus-base

medical-imaging

focused-ultrasound

transcranial-ultrasound

Model card Files Files and versions

deeptfus-base / README.md

masonwang025's picture

upload

4bee0a6 6 days ago

|

history blame contribute delete

2.92 kB

	---
	license: cc-by-nc-nd-4.0
	library_name: pytorch
	tags:
	- medical-imaging
	- 3d-cnn
	- ultrasound
	- focused-ultrasound
	- transcranial-ultrasound
	- reproduction
	datasets:
	- vinkle-srivastav/TFUScapes
	language:
	- en
	---

	# DeepTFUS: base (run-1 reproduction)

	A reproduction attempt of DeepTFUS, proposed by [Srivastav et al. (arXiv:2505.12998)](https://arxiv.org/abs/2505.12998).

	This is the from-scratch baseline: 50 epochs on the paper recipe
	(weighted-MSE + λ·gradient-L1, no focal-position aux), `base_width=16`
	(3.4 M params), `pure-bf16`, `batch=4` at 256³ resolution. Given a 3D
	head CT and a transducer placement, predicts the resulting in-skull
	pressure field in <1 s on an H100 (≈ 50× faster than the k-Wave
	physics simulator the dataset was generated from).

	⭐ Partial reproduction: matched paper on `relative_l2`, did not match
	on `focal_position_error_mm` (~2× worse) or `max_pressure_error`. This
	gap motivated the 5 fine-tune variants in this model collection.

	## Test results (n = 597 held-out CT × placement combinations)

	\| metric \| paper \| base (this model) \| reproduced? \|
	\|---\|---:\|---:\|---\|
	\| `relative_l2` mean ± std \| 0.414 ± 0.086 \| 0.384 ± 0.078 \| ✅ Yes (slightly beats paper) \|
	\| `relative_l2` median \| 0.394 \| 0.369 \| ✅ \|
	\| `focal_position_error_mm` mean ± std \| 2.89 ± 2.14 \| 6.49 ± 4.58 \| ❌ No (~2.25× worse mean) \|
	\| `focal_position_error_mm` median \| 2.45 \| 5.15 \| ❌ \|
	\| `max_pressure_error` mean ± std \| 0.199 ± 0.158 \| 0.225 ± 0.116 \| ✅ Yes (within paper's std) \|
	\| `max_pressure_error` median \| 0.166 \| 0.217 \| (slightly above paper) \|
	\| `focal_pressure_error` median \| : \| 0.528 \| : \|
	\| `focal_iou_fwhm` median \| : \| 0.143 \| : \|
	\| `inference_latency_s` (b=1, H100) \| 11.4 (RTX 4090) \| 0.233 \| 49× faster (different HW) \|

	## Other variants and discussion

	See the [Collection](https://huggingface.co/collections/masonwang025/deeptfus-reproduction-6a03e39286a09470b960511f)
	for the 5 fine-tune variants built from this base ckpt, and the
	[project page](https://masonjwang.com/projects/reproducing-deeptfus)
	for the full reproduction story, interactive viewer, and discussion of
	trade-offs.

	## Usage

	```python
	from huggingface_hub import hf_hub_download
	import torch

	ckpt = torch.load(
	hf_hub_download("masonwang025/deeptfus-base", "ckpt_best.pt"),
	map_location="cpu", weights_only=False,
	)
	# ckpt['model'] : state_dict for the model defined in masonwang025/deeptfus repo
	# ckpt['config'] : training config (architecture knobs + train hyperparams)
	# ckpt['epoch'] : 43 (best by val_rel_l2)
	```

	Model code: [github.com/masonwang025/deeptfus](https://github.com/masonwang025/deeptfus).

	## Citation & License

	Paper: Srivastav et al., [arXiv:2505.12998](https://arxiv.org/abs/2505.12998), 2025.

	License: [CC-BY-NC-ND-4.0](https://creativecommons.org/licenses/by-nc-nd/4.0/),
	matching the TFUScapes dataset license.