FlexiCT / README.md
ricklisz123's picture
Upload folder using huggingface_hub
2e3a513 verified
---
license: cc-by-nc-nd-4.0
tags:
- medical-imaging
- ct
- vision
- vision-language
---
# FlexiCT
FlexiCT is a CT foundation model family trained through agglomerative continual pretraining from 2D slice-level anatomy to 3D volumetric reasoning and report-aligned vision-language understanding.
This family page links three child repos:
| Repo | Input | Output | Recommended use |
|---|---|---|---|
| `ricklisz/FlexiCT-2D` | `[B, 1, 512, 512]` CT slices | CLS and patch tokens | Slice-level feature extraction, classification, visualization, registration features |
| `ricklisz/FlexiCT-3D` | `[B, 1, 160, 160, 160]` CT volumes | CLS and patch tokens | Whole-volume feature extraction and downstream 3D workflows |
| `ricklisz/FlexiCT-3D-VLM` | CT volumes plus text | Image/text embeddings and similarity scores | Report-aligned retrieval and zero-shot text-image scoring |
## Preprocessing presets
`default` is recommended for whole-volume 3D and 3D-VLM inference. It orients/resamples path inputs to LPS at 2 mm spacing when spacing is available, clips HU to `[-1000, 1000]`, z-score normalizes, pads with the tensor minimum to at least `160^3`, then center crops to `160^3`. This best matches the released VLM evaluation path because it preserves physical scale better than globally resizing the anatomy.
`local_path` is a forgiving demo preset for arbitrary local CT files. It orients/resamples path inputs, clips, normalizes, pads to a cube, then trilinear-resizes to `160^3`. It is robust to heterogeneous scans but less faithful to the VLM inference scripts because it globally rescales anatomy.
`retrieval_roi` is retrieval-specific. It orients/resamples, clips, normalizes, crops an ROI cube from coordinates, a mask, or a bounding box, pads if the crop hits an image boundary, then resizes to `160^3`.
## License
The released checkpoints are made available under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. Code is MIT licensed in the source repository. Users must also comply with licenses and usage terms for the original datasets used to train the models.
## Citation
```bibtex
@misc{li2026universalctrepresentations,
title = {Universal CT Representations from Anatomy to Disease Phenotype through Agglomerative Pretraining},
author = {Yuheng Li and Yuan Gao and Haoyu Dong and Yuxiang Lai and Shansong Wang and Mojtaba Safari and James E. Baciak and Xiaofeng Yang},
year = {2026},
eprint = {2605.21906},
archivePrefix = {arXiv},
primaryClass = {cs.CV},
doi = {10.48550/arXiv.2605.21906},
url = {https://arxiv.org/abs/2605.21906}
}
```
## Medical disclaimer
FlexiCT is for research use only. It is not a medical device and is not a substitute for professional medical judgment.