OminiAbnorm-CT / README.md
zzh99's picture
Update README.md
f9448ea verified
---
license: cc-by-nc-sa-3.0
---
This repo contains the checkpoints for OminiAbnorm-CT, a multi-modal generative model for grounded abnormality analysis on CT images from multiple planes and all human body regions.
It supports three representative tasks:
- Visual prompted report generation: Interpret an abnormality marked by a red bounding box, ellipse, contour, or cropped region.
- Grounded report generation: Ground and interpret all abnormalities on the CT image.
- Text-guided grounded report generation: Detect, ground and interpret a specific abnormality on the CT image.
![Example Figure](Model.png)
It is built on **[OminiAbnorm-CT-14K](https://huggingface.co/datasets/zzh99/OminiAbnorm-CT-14K)**, the first large-scale dataset designed for abnormality grounding and description on multi-plane whole-body CT imaging.
It contains 14.5K CT images with grounding annotation for 19K abnormal findings.
Each abnormal finding is further linked to the detailed description in the report, and categorized according to a comprehensive hierarchical taxonomy.
![Example Figure](Demo.png)
Check our [paper](https://www.arxiv.org/abs/2506.03238) and [github repo](https://github.com/zhaoziheng/OminiAbnorm-CT) for usage and more details.