File size: 1,884 Bytes
b1f14a6
2ff743f
b1f14a6
2ff743f
 
 
 
 
 
 
b1f14a6
 
2ff743f
b1f14a6
2ff743f
 
b1f14a6
2ff743f
 
 
b1f14a6
2ff743f
b1f14a6
2ff743f
b1f14a6
2ff743f
 
b1f14a6
2ff743f
 
 
 
 
894ad96
2ff743f
b1f14a6
2ff743f
b1f14a6
2ff743f
b1f14a6
2ff743f
b1f14a6
2ff743f
 
b1f14a6
2ff743f
 
 
 
b1f14a6
2ff743f
b1f14a6
2ff743f
b1f14a6
2ff743f
b1f14a6
2ff743f
 
 
 
 
b1f14a6
2ff743f
 
b1f14a6
2ff743f
b1f14a6
2ff743f
b1f14a6
2ff743f
 
894ad96
2ff743f
b1f14a6
2ff743f
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
---
license: mit
library_name: diffusers
tags:
  - computed-tomography
  - ct-reconstruction
  - diffusion-model
  - inverse-problems
  - dm4ct
  - sparse-view-ct
---

# Pixel Diffusion UNet – LoDoInd (DM4CT)

This repository contains the pretrained **pixel-space diffusion UNet** used in the  
**DM4CT: Benchmarking Diffusion Models for CT Reconstruction (ICLR 2026)** benchmark.

πŸ”— Paper: https://openreview.net/forum?id=YE5scJekg5  
πŸ”— Arxiv: https://arxiv.org/abs/2602.18589  
πŸ”— Codebase: https://github.com/DM4CT/DM4CT  

---

## πŸ”¬ Model Overview

This model learns a **prior over CT reconstruction images** using a denoising diffusion probabilistic model (DDPM).  
It operates directly in **pixel space** (not latent space).

- **Architecture**: 2D UNet (Diffusers `UNet2DModel`)
- **Input resolution**: 512 Γ— 512
- **Channels**: 1 (grayscale CT slice)
- **Training objective**: Ξ΅-prediction (standard DDPM formulation)
- **Noise schedule**: Linear beta schedule
- **Training dataset**: Industry CT dataset (LoDoInd)
- **Intensity normalization**: Rescaled to (-1, 1)

This model is intended to be combined with data-consistency correction for CT reconstruction.

---

## πŸ“Š Dataset: LoDoInd

Source:
https://www.aapm.org/grandchallenge/lowdosect/

Preprocessing steps:
- Train/test split
- Rescale reconstructed slices to (-1, 1)
- No geometry information is embedded in the model

The model learns an unconditional image prior over CT slices.

---

## 🧠 Training Details

- Optimizer: AdamW
- Learning rate: 1e-4
- Batch size: (insert your batch size)
- Training steps: (insert number of steps)
- Hardware: NVIDIA A100 GPU

Training script:
https://github.com/DM4CT/DM4CT/blob/main/train_pixel.py

---

## πŸš€ Usage

```python
from diffusers import DDPMPipeline
pipeline = DDPMPipeline.from_pretrained("jiayangshi/lodoind_pixel_diffusion")
)

model.eval()