---
license: mit
language:
- en
tags:
- image-classification
- medical
- cervical-cancer
- pytorch
- cnn
- colposcopy
datasets:
- custom
metrics:
- accuracy
- f1
pipeline_tag: image-classification
library_name: pytorch
---

# Cervical Cancer Classification CNN

A CNN model for classifying cervical colposcopy images into 4 severity classes for cervical cancer screening.

## Model Description

This model classifies cervical images into:

| Class | Label | Description | Clinical Action |
|-------|-------|-------------|-----------------|
| 0 | Normal | Healthy cervical tissue | Routine screening in 3-5 years |
| 1 | LSIL | Low-grade Squamous Intraepithelial Lesion | Monitor, repeat test in 6-12 months |
| 2 | HSIL | High-grade Squamous Intraepithelial Lesion | Colposcopy, biopsy, treatment required |
| 3 | Cancer | Invasive cervical cancer | Immediate oncology referral |

---

## Model Architecture

### Architecture Diagram

```
┌─────────────────────────────────────────────────────────────┐
│                    INPUT IMAGE                              │
│                   (3 × 224 × 298)                           │
└─────────────────────────┬───────────────────────────────────┘
                          │
┌─────────────────────────▼───────────────────────────────────┐
│  CONV BLOCK 1                                               │
│  ├── Conv2d(3 → 32, kernel=3×3, padding=1)                  │
│  ├── BatchNorm2d(32)                                        │
│  ├── ReLU                                                   │
│  └── MaxPool2d(2×2)                                         │
│  Output: 32 × 112 × 149                                     │
└─────────────────────────┬───────────────────────────────────┘
                          │
┌─────────────────────────▼───────────────────────────────────┐
│  CONV BLOCK 2                                               │
│  ├── Conv2d(32 → 64, kernel=3×3, padding=1)                 │
│  ├── BatchNorm2d(64)                                        │
│  ├── ReLU                                                   │
│  └── MaxPool2d(2×2)                                         │
│  Output: 64 × 56 × 74                                       │
└─────────────────────────┬───────────────────────────────────┘
                          │
┌─────────────────────────▼───────────────────────────────────┐
│  CONV BLOCK 3                                               │
│  ├── Conv2d(64 → 128, kernel=3×3, padding=1)                │
│  ├── BatchNorm2d(128)                                       │
│  ├── ReLU                                                   │
│  └── MaxPool2d(2×2)                                         │
│  Output: 128 × 28 × 37                                      │
└─────────────────────────┬───────────────────────────────────┘
                          │
┌─────────────────────────▼───────────────────────────────────┐
│  CONV BLOCK 4                                               │
│  ├── Conv2d(128 → 256, kernel=3×3, padding=1)               │
│  ├── BatchNorm2d(256)                                       │
│  ├── ReLU                                                   │
│  └── MaxPool2d(2×2)                                         │
│  Output: 256 × 14 × 18                                      │
└─────────────────────────┬───────────────────────────────────┘
                          │
┌─────────────────────────▼───────────────────────────────────┐
│  GLOBAL AVERAGE POOLING                                     │
│  └── AdaptiveAvgPool2d(1×1)                                 │
│  Output: 256 × 1 × 1 → Flatten → 256                        │
└─────────────────────────┬───────────────────────────────────┘
                          │
┌─────────────────────────▼───────────────────────────────────┐
│  FC BLOCK 1                                                 │
│  ├── Linear(256 → 256)                                      │
│  ├── ReLU                                                   │
│  └── Dropout(0.5)                                           │
└─────────────────────────┬───────────────────────────────────┘
                          │
┌─────────────────────────▼───────────────────────────────────┐
│  FC BLOCK 2                                                 │
│  ├── Linear(256 → 128)                                      │
│  ├── ReLU                                                   │
│  └── Dropout(0.5)                                           │
└─────────────────────────┬───────────────────────────────────┘
                          │
┌─────────────────────────▼───────────────────────────────────┐
│  CLASSIFIER                                                 │
│  └── Linear(128 → 4)                                        │
│  Output: 4 class logits                                     │
└─────────────────────────┬───────────────────────────────────┘
                          │
                          ▼
              [Normal, LSIL, HSIL, Cancer]
```

### Architecture Summary Table

| Layer | Type | Input Shape | Output Shape | Parameters |
|-------|------|-------------|--------------|------------|
| conv_layers.0 | Conv2d | (3, 224, 298) | (32, 224, 298) | 896 |
| conv_layers.1 | BatchNorm2d | (32, 224, 298) | (32, 224, 298) | 64 |
| conv_layers.2 | ReLU | - | - | 0 |
| conv_layers.3 | MaxPool2d | (32, 224, 298) | (32, 112, 149) | 0 |
| conv_layers.4 | Conv2d | (32, 112, 149) | (64, 112, 149) | 18,496 |
| conv_layers.5 | BatchNorm2d | (64, 112, 149) | (64, 112, 149) | 128 |
| conv_layers.6 | ReLU | - | - | 0 |
| conv_layers.7 | MaxPool2d | (64, 112, 149) | (64, 56, 74) | 0 |
| conv_layers.8 | Conv2d | (64, 56, 74) | (128, 56, 74) | 73,856 |
| conv_layers.9 | BatchNorm2d | (128, 56, 74) | (128, 56, 74) | 256 |
| conv_layers.10 | ReLU | - | - | 0 |
| conv_layers.11 | MaxPool2d | (128, 56, 74) | (128, 28, 37) | 0 |
| conv_layers.12 | Conv2d | (128, 28, 37) | (256, 28, 37) | 295,168 |
| conv_layers.13 | BatchNorm2d | (256, 28, 37) | (256, 28, 37) | 512 |
| conv_layers.14 | ReLU | - | - | 0 |
| conv_layers.15 | MaxPool2d | (256, 28, 37) | (256, 14, 18) | 0 |
| avgpool | AdaptiveAvgPool2d | (256, 14, 18) | (256, 1, 1) | 0 |
| fc_layers.0 | Linear | 256 | 256 | 65,792 |
| fc_layers.1 | ReLU | - | - | 0 |
| fc_layers.2 | Dropout | - | - | 0 |
| fc_layers.3 | Linear | 256 | 128 | 32,896 |
| fc_layers.4 | ReLU | - | - | 0 |
| fc_layers.5 | Dropout | - | - | 0 |
| classifier | Linear | 128 | 4 | 516 |
| **Total** | | | | **488,580** |

### PyTorch Model Code

```python
import torch
import torch.nn as nn

class CervicalCancerCNN(nn.Module):
    def __init__(self):
        super().__init__()

        # Convolutional layers: [32, 64, 128, 256]
        self.conv_layers = nn.Sequential(
            # Block 1: 3 -> 32
            nn.Conv2d(3, 32, kernel_size=3, padding=1),
            nn.BatchNorm2d(32),
            nn.ReLU(inplace=True),
            nn.MaxPool2d(2, 2),

            # Block 2: 32 -> 64
            nn.Conv2d(32, 64, kernel_size=3, padding=1),
            nn.BatchNorm2d(64),
            nn.ReLU(inplace=True),
            nn.MaxPool2d(2, 2),

            # Block 3: 64 -> 128
            nn.Conv2d(64, 128, kernel_size=3, padding=1),
            nn.BatchNorm2d(128),
            nn.ReLU(inplace=True),
            nn.MaxPool2d(2, 2),

            # Block 4: 128 -> 256
            nn.Conv2d(128, 256, kernel_size=3, padding=1),
            nn.BatchNorm2d(256),
            nn.ReLU(inplace=True),
            nn.MaxPool2d(2, 2),
        )

        self.avgpool = nn.AdaptiveAvgPool2d(1)

        # Fully connected layers: [256, 128] -> 4
        self.fc_layers = nn.Sequential(
            nn.Linear(256, 256),
            nn.ReLU(inplace=True),
            nn.Dropout(0.5),
            nn.Linear(256, 128),
            nn.ReLU(inplace=True),
            nn.Dropout(0.5),
        )

        self.classifier = nn.Linear(128, 4)

    def forward(self, x):
        x = self.conv_layers(x)
        x = self.avgpool(x)
        x = x.view(x.size(0), -1)
        x = self.fc_layers(x)
        x = self.classifier(x)
        return x
```

---

## Performance

### Overall Metrics

| Metric | Value |
|--------|-------|
| **Accuracy** | 59.52% |
| **Macro F1** | 59.85% |
| **Parameters** | 488,580 |

### Per-Class Metrics

| Class | Precision | Recall | F1 Score | Support |
|-------|-----------|--------|----------|---------|
| Normal | 0.595 | 0.595 | 0.595 | 84 |
| LSIL | 0.521 | 0.583 | 0.551 | 84 |
| HSIL | 0.446 | 0.440 | 0.443 | 84 |
| Cancer | 0.853 | 0.762 | 0.805 | 84 |

### Confusion Matrix

```
Predicted →     Normal    LSIL    HSIL   Cancer
Actual ↓
Normal            50        9       17       8
LSIL              24       49       11       0
HSIL               9       35       37       3
Cancer             1        1       18      64
```

---

## Usage

### Installation

```bash
pip install torch torchvision safetensors huggingface_hub
```

### Loading the Model

```python
import torch
from safetensors.torch import load_file
from huggingface_hub import hf_hub_download
import json

# Download model files
model_file = hf_hub_download("toderian/cerviguard_lesion", "model.safetensors")
config_file = hf_hub_download("toderian/cerviguard_lesion", "config.json")

# Load config
with open(config_file) as f:
    config = json.load(f)

# Define model (copy from above or download modeling_cervical.py)
model = CervicalCancerCNN()

# Load weights
state_dict = load_file(model_file)
model.load_state_dict(state_dict)
model.eval()
```

### Inference

```python
from PIL import Image
import torchvision.transforms as T

# Preprocessing
transform = T.Compose([
    T.Resize((224, 298)),
    T.ToTensor(),
    T.Normalize(mean=[0.485, 0.456, 0.406], std=[0.229, 0.224, 0.225]),
])

# Load and preprocess image
image = Image.open("cervical_image.jpg").convert("RGB")
input_tensor = transform(image).unsqueeze(0)

# Inference
with torch.no_grad():
    output = model(input_tensor)
    probabilities = torch.softmax(output, dim=1)
    prediction = output.argmax(dim=1).item()

classes = ["Normal", "LSIL", "HSIL", "Cancer"]
print(f"Prediction: {classes[prediction]}")
print(f"Confidence: {probabilities[0][prediction]:.2%}")
```

---

## Training Details

| Parameter | Value |
|-----------|-------|
| Learning Rate | 1e-4 |
| Batch Size | 32 |
| Optimizer | Adam |
| Loss | CrossEntropyLoss |
| Dropout | 0.5 |
| Epochs | 34 (early stopping at 24) |

### Dataset

| Split | Samples | Distribution |
|-------|---------|--------------|
| Train | 3,003 | Imbalanced [1540, 469, 854, 140] |
| Test | 336 | Balanced [84, 84, 84, 84] |

---

## Limitations

- Trained on limited dataset (~3k samples)
- HSIL class has lowest performance (F1=0.443)
- Should not be used as sole diagnostic tool
- Intended for research and screening assistance only

## Medical Disclaimer

⚠️ **This model is for research purposes only.** It should not be used as a substitute for professional medical diagnosis. Always consult qualified healthcare professionals for cervical cancer screening and diagnosis.

---

## Files in This Repository

| File | Description |
|------|-------------|
| `model.safetensors` | Model weights (safetensors format) |
| `pytorch_model.bin` | Model weights (legacy PyTorch format) |
| `config.json` | Model configuration |
| `preprocessor_config.json` | Image preprocessing settings |
| `modeling_cervical.py` | Model class definition |
| `example_inference.py` | Example inference script |

---

## Citation

```bibtex
@misc{cervical-cancer-cnn-2025,
  author = {Toderian},
  title = {Cervical Cancer Classification CNN},
  year = {2025},
  publisher = {Hugging Face},
  url = {https://huggingface.co/toderian/cerviguard_lesion}
}
```