arudaev
/

chexvision-scratch

Image Classification

medical-imaging

multi-label-classification

Model card Files Files and versions

chexvision-scratch / README.md

arudaev's picture

docs: add presentation deck link

b4be829 verified 6 days ago

|

history blame contribute delete

3.49 kB

	---
	license: mit
	language:
	- en
	library_name: pytorch
	pipeline_tag: image-classification
	tags:
	- chexvision
	- medical-imaging
	- chest-xray
	- radiology
	- pytorch
	- multi-label-classification
	datasets:
	- HlexNC/chest-xray-14-320
	---

	# CheXVision-ResNet

	## Project Resources

	- [GitHub repository](https://github.com/arudaev/chexvision)
	- [Presentation deck](https://arudaev.github.io/chexvision/)
	- [Live demo](https://huggingface.co/spaces/arudaev/chexvision-demo)
	- [Dataset](https://huggingface.co/datasets/arudaev/chest-xray-14-320)

	> CheXVision — Deep Learning & Big Data university project.
	> 14-class chest X-ray pathology detection + binary normal/abnormal classification
	> on the NIH Chest X-ray14 dataset (112,120 images).

	## Architecture

	<p align="center">
	<img src="https://huggingface.co/HlexNC/chexvision-scratch/resolve/main/arch_scratch.png" width="88%" alt="SE-ResNet Architecture"/>
	</p>

	## Training Pipeline

	<p align="center">
	<img src="https://huggingface.co/HlexNC/chexvision-scratch/resolve/main/pipeline_training.png" width="42%" alt="Training Pipeline"/>
	</p>

	## Training Metrics

	- Best validation macro AUC-ROC: `0.8008`
	- Best validation binary AUC-ROC: `0.7571`
	- Best validation binary F1: `0.6474`
	- Best checkpoint epoch: `60`


	## Training Configuration

	- Repository: `HlexNC/chexvision-scratch`
	- Dataset: [HlexNC/chest-xray-14-320](https://huggingface.co/datasets/HlexNC/chest-xray-14-320) · revision `44443e6ee968b3c6094b63f14a27698c40b50680`
	- Architecture: Custom residual CNN with Squeeze-Excitation channel attention (depth [3, 4, 6, 3]) trained from scratch with shared features and dual classification heads.
	- Platform: Kaggle GPU kernel (NVIDIA T4 / P100)
	- Batch size: `24` × grad_accum `4` = effective batch `96`
	- AMP (fp16): `enabled`
	- CLAHE preprocessing: `disabled`
	- Label smoothing: `0.0`
	- Optimizer: AdamW · Scheduler: CosineAnnealingLR
	- Epochs configured: `100` · Early stop patience: `15`

	## Intended Use

	This model is intended for research and educational work on automated chest X-ray pathology detection.
	It outputs two predictions per image:
	1. Multi-label scores — independent sigmoid probability for each of 14 NIH pathologies
	2. Binary score — sigmoid probability of any abnormality (Normal vs. Abnormal)

	## Limitations

	- Not validated for clinical use. Predictions must not substitute professional medical judgment.
	- Trained on NIH Chest X-ray14, which contains noisy radiologist annotations (patient-level labels, not lesion-level).
	- Performance degrades on images from equipment, patient populations, or preprocessing pipelines
	that differ from the NIH training distribution.
	- Reported AUC metrics are on the validation split, not the held-out test set.

	## CheXNet Benchmark Context

	CheXNet (Rajpurkar et al., 2017) — the seminal paper establishing DenseNet-121 for chest X-ray
	classification — reported 0.841 macro AUC-ROC on a comparable split of this dataset.
	CheXVision-DenseNet matches this benchmark. See the
	[CheXVision demo](https://huggingface.co/spaces/HlexNC/chexvision-demo) for live inference.

	## Citation

	```bibtex
	@misc{chexvision2026,
	title={CheXVision: Dual-Task Chest X-ray Classification with Custom CNN and DenseNet-121},
	author={BIG D(ATA) Team},
	year={2026},
	howpublished={\url{https://huggingface.co/HlexNC/chexvision-scratch}}
	}
	```