gdds / README.md
Xssama's picture
Update README.md
f866388 verified
metadata
library_name: pytorch
license: mit
pipeline_tag: text-generation
tags:
  - gdds
  - discrete-diffusion
  - language-modeling
  - research
  - pytorch

GDDS Checkpoints

Official checkpoint bundle for the paper Generalized Discrete Diffusion from Snapshots.

Generalized Discrete Diffusion from Snapshots (GDDS) is a unified framework for discrete diffusion modeling that supports arbitrary noising processes over large discrete state spaces. It introduces a training objective based on snapshot latents rather than the entire noising path, allowing for efficient training and high-quality generation.

Model Sources

Included Checkpoints

File Method Notes
checkpoints/gdds_gauss_500k.ckpt GDDS 500k-step checkpoint with the Gaussian SIK forward process
checkpoints/gdds_uniform_500k.ckpt GDDS 500k-step checkpoint with the uniform forward process
checkpoints/gdds_absorb_500k.ckpt GDDS 500k-step checkpoint with the absorbing forward process
checkpoints/mdlm_500k.ckpt MDLM 500k-step baseline checkpoint
checkpoints/udlm_500k.ckpt UDLM 500k-step baseline checkpoint
checkpoints/ar_500k.ckpt AR 500k-step autoregressive baseline checkpoint

Usage

These files are PyTorch Lightning checkpoints intended to be used with the gdds codebase.

git clone https://github.com/ozekri/gdds.git
cd gdds
pip install -r requirements.txt
pip install -e .

# Example evaluation using a checkpoint
PYTHONPATH=src python -m discrete_diffusion.evaluations.ppl_eval \
  data=openwebtext \
  model=small \
  algo=mdlm \
  eval.checkpoint_path=/path/to/checkpoints/mdlm_500k.ckpt

For sampling and other evaluations, use the same repository and pass the relevant checkpoint path through the Hydra evaluation config.

Citation

@misc{zekri2026generalizeddiscretediffusionsnapshots,
  title={Generalized Discrete Diffusion from Snapshots},
  author={Oussama Zekri and Th{\\'e}o Uscidda and Nicolas Boull{\\'e} and Anna Korba},
  year={2026},
  eprint={2603.21342},
  archivePrefix={arXiv},
  primaryClass={stat.ML},
  url={https://arxiv.org/abs/2603.21342},
}