| --- |
| license: cc-by-nc-nd-4.0 |
| tags: |
| - medical-imaging |
| - ct |
| - vision |
| - vision-language |
| --- |
| |
| # FlexiCT |
|
|
| FlexiCT is a CT foundation model family trained through agglomerative continual pretraining from 2D slice-level anatomy to 3D volumetric reasoning and report-aligned vision-language understanding. |
|
|
| This family page links three child repos: |
|
|
| | Repo | Input | Output | Recommended use | |
| |---|---|---|---| |
| | `ricklisz/FlexiCT-2D` | `[B, 1, 512, 512]` CT slices | CLS and patch tokens | Slice-level feature extraction, classification, visualization, registration features | |
| | `ricklisz/FlexiCT-3D` | `[B, 1, 160, 160, 160]` CT volumes | CLS and patch tokens | Whole-volume feature extraction and downstream 3D workflows | |
| | `ricklisz/FlexiCT-3D-VLM` | CT volumes plus text | Image/text embeddings and similarity scores | Report-aligned retrieval and zero-shot text-image scoring | |
|
|
| ## Preprocessing presets |
|
|
| `default` is recommended for whole-volume 3D and 3D-VLM inference. It orients/resamples path inputs to LPS at 2 mm spacing when spacing is available, clips HU to `[-1000, 1000]`, z-score normalizes, pads with the tensor minimum to at least `160^3`, then center crops to `160^3`. This best matches the released VLM evaluation path because it preserves physical scale better than globally resizing the anatomy. |
|
|
| `local_path` is a forgiving demo preset for arbitrary local CT files. It orients/resamples path inputs, clips, normalizes, pads to a cube, then trilinear-resizes to `160^3`. It is robust to heterogeneous scans but less faithful to the VLM inference scripts because it globally rescales anatomy. |
|
|
| `retrieval_roi` is retrieval-specific. It orients/resamples, clips, normalizes, crops an ROI cube from coordinates, a mask, or a bounding box, pads if the crop hits an image boundary, then resizes to `160^3`. |
|
|
| ## License |
|
|
| The released checkpoints are made available under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. Code is MIT licensed in the source repository. Users must also comply with licenses and usage terms for the original datasets used to train the models. |
|
|
| ## Citation |
|
|
| ```bibtex |
| @misc{li2026universalctrepresentations, |
| title = {Universal CT Representations from Anatomy to Disease Phenotype through Agglomerative Pretraining}, |
| author = {Yuheng Li and Yuan Gao and Haoyu Dong and Yuxiang Lai and Shansong Wang and Mojtaba Safari and James E. Baciak and Xiaofeng Yang}, |
| year = {2026}, |
| eprint = {2605.21906}, |
| archivePrefix = {arXiv}, |
| primaryClass = {cs.CV}, |
| doi = {10.48550/arXiv.2605.21906}, |
| url = {https://arxiv.org/abs/2605.21906} |
| } |
| ``` |
|
|
| ## Medical disclaimer |
|
|
| FlexiCT is for research use only. It is not a medical device and is not a substitute for professional medical judgment. |
|
|