nielsr HF Staff commited on
Commit
2ef2b8d
Β·
verified Β·
1 Parent(s): 7774130

Add pipeline tag and improve model card

Browse files

Hi, I'm Niels from the Hugging Face community team.

This PR improves the model card for the LoDoChallenge pixel diffusion model:
- Adds the `image-to-image` pipeline tag to the metadata.
- Fixes the usage snippet to correctly showcase how to use the model with the `diffusers` library.
- Adds links to the official project page, codebase, and research paper.
- Adds a BibTeX citation for the DM4CT paper.

Files changed (1) hide show
  1. README.md +46 -31
README.md CHANGED
@@ -1,30 +1,29 @@
1
  ---
2
- license: mit
3
  library_name: diffusers
 
 
4
  tags:
5
- - computed-tomography
6
- - ct-reconstruction
7
- - diffusion-model
8
- - inverse-problems
9
- - dm4ct
10
- - sparse-view-ct
11
  ---
12
 
13
  # Pixel Diffusion UNet – LoDoChallenge (DM4CT)
14
 
15
- This repository contains the pretrained **pixel-space diffusion UNet** used in the
16
- **DM4CT: Benchmarking Diffusion Models for CT Reconstruction (ICLR 2026)** benchmark.
17
 
18
- πŸ”— Paper: https://openreview.net/forum?id=YE5scJekg5
19
- πŸ”— Arxiv: https://arxiv.org/abs/2602.18589
20
- πŸ”— Codebase: https://github.com/DM4CT/DM4CT
21
 
22
  ---
23
 
24
  ## πŸ”¬ Model Overview
25
 
26
- This model learns a **prior over CT reconstruction images** using a denoising diffusion probabilistic model (DDPM).
27
- It operates directly in **pixel space** (not latent space).
28
 
29
  - **Architecture**: 2D UNet (Diffusers `UNet2DModel`)
30
  - **Input resolution**: 512 Γ— 512
@@ -34,42 +33,58 @@ It operates directly in **pixel space** (not latent space).
34
  - **Training dataset**: Low Dose Grand Challenge (LoDoChallenge)
35
  - **Intensity normalization**: Rescaled to (-1, 1)
36
 
37
- This model is intended to be combined with data-consistency correction for CT reconstruction.
38
 
39
  ---
40
 
41
  ## πŸ“Š Dataset: Low Dose Grand Challenge
42
 
43
- Source:
44
- https://www.aapm.org/grandchallenge/lowdosect/
45
 
46
  Preprocessing steps:
47
- - Train/test split
48
- - Rescale reconstructed slices to (-1, 1)
49
- - No geometry information is embedded in the model
50
 
51
- The model learns an unconditional image prior over CT slices.
52
 
53
  ---
54
 
55
  ## 🧠 Training Details
56
 
57
- - Optimizer: AdamW
58
- - Learning rate: 1e-4
59
- - Batch size: (insert your batch size)
60
- - Training steps: (insert number of steps)
61
- - Hardware: NVIDIA A100 GPU
62
-
63
- Training script:
64
- https://github.com/DM4CT/DM4CT/blob/main/train_pixel.py
65
 
66
  ---
67
 
68
  ## πŸš€ Usage
69
 
 
 
70
  ```python
71
  from diffusers import DDPMPipeline
 
 
72
  pipeline = DDPMPipeline.from_pretrained("jiayangshi/lodochallenge_pixel_diffusion")
73
- )
74
 
75
- model.eval()
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
 
2
  library_name: diffusers
3
+ license: mit
4
+ pipeline_tag: image-to-image
5
  tags:
6
+ - computed-tomography
7
+ - ct-reconstruction
8
+ - diffusion-model
9
+ - inverse-problems
10
+ - dm4ct
11
+ - sparse-view-ct
12
  ---
13
 
14
  # Pixel Diffusion UNet – LoDoChallenge (DM4CT)
15
 
16
+ This repository contains the pretrained **pixel-space diffusion UNet** presented in the paper [DM4CT: Benchmarking Diffusion Models for Computed Tomography Reconstruction](https://huggingface.co/papers/2602.18589) (ICLR 2026).
 
17
 
18
+ - **Project Page:** [https://dm4ct.github.io/DM4CT/](https://dm4ct.github.io/DM4CT/)
19
+ - **Codebase:** [https://github.com/DM4CT/DM4CT](https://github.com/DM4CT/DM4CT)
20
+ - **Paper:** [DM4CT: Benchmarking Diffusion Models for Computed Tomography Reconstruction](https://huggingface.co/papers/2602.18589)
21
 
22
  ---
23
 
24
  ## πŸ”¬ Model Overview
25
 
26
+ This model learns a **prior over CT reconstruction images** using a denoising diffusion probabilistic model (DDPM). It operates directly in **pixel space** (not latent space).
 
27
 
28
  - **Architecture**: 2D UNet (Diffusers `UNet2DModel`)
29
  - **Input resolution**: 512 Γ— 512
 
33
  - **Training dataset**: Low Dose Grand Challenge (LoDoChallenge)
34
  - **Intensity normalization**: Rescaled to (-1, 1)
35
 
36
+ This model is intended to be combined with data-consistency correction for CT reconstruction tasks as detailed in the DM4CT benchmark.
37
 
38
  ---
39
 
40
  ## πŸ“Š Dataset: Low Dose Grand Challenge
41
 
42
+ Source: [AAPM Low Dose CT Grand Challenge](https://www.aapm.org/grandchallenge/lowdosect/)
 
43
 
44
  Preprocessing steps:
45
+ - Train/test split.
46
+ - Rescale reconstructed slices to (-1, 1).
47
+ - No geometry information is embedded in the model.
48
 
49
+ The model learns an unconditional image prior over medical CT slices.
50
 
51
  ---
52
 
53
  ## 🧠 Training Details
54
 
55
+ - **Optimizer**: AdamW
56
+ - **Learning rate**: 1e-4
57
+ - **Hardware**: NVIDIA A100 GPU
58
+ - **Training script**: [train_pixel.py](https://github.com/DM4CT/DM4CT/blob/main/train_pixel.py)
 
 
 
 
59
 
60
  ---
61
 
62
  ## πŸš€ Usage
63
 
64
+ You can use this model with the `diffusers` library:
65
+
66
  ```python
67
  from diffusers import DDPMPipeline
68
+
69
+ # Load the pretrained pipeline
70
  pipeline = DDPMPipeline.from_pretrained("jiayangshi/lodochallenge_pixel_diffusion")
 
71
 
72
+ # Generate a sample (unconditional CT slice prior)
73
+ image = pipeline().images[0]
74
+ image.save("generated_ct_slice.png")
75
+ ```
76
+
77
+ ---
78
+
79
+ ## Citation
80
+
81
+ ```bibtex
82
+ @inproceedings{
83
+ shi2026dmct,
84
+ title={{DM}4{CT}: Benchmarking Diffusion Models for Computed Tomography Reconstruction},
85
+ author={Shi, Jiayang and Pelt, Dani{\"e}l M and Batenburg, K Joost},
86
+ booktitle={The Fourteenth International Conference on Learning Representations},
87
+ year={2026},
88
+ url={https://openreview.net/forum?id=YE5scJekg5}
89
+ }
90
+ ```