--- library_name: pytorch license: mit pipeline_tag: text-generation tags: - gdds - discrete-diffusion - language-modeling - research - pytorch --- # GDDS Checkpoints Official checkpoint bundle for the paper **Generalized Discrete Diffusion from Snapshots**. Generalized Discrete Diffusion from Snapshots (GDDS) is a unified framework for discrete diffusion modeling that supports arbitrary noising processes over large discrete state spaces. It introduces a training objective based on snapshot latents rather than the entire noising path, allowing for efficient training and high-quality generation. ## Model Sources - **Paper:** [Generalized Discrete Diffusion from Snapshots](https://huggingface.co/papers/2603.21342) - **arXiv:** [2603.21342](https://arxiv.org/abs/2603.21342) - **Code:** [GitHub - ozekri/gdds](https://github.com/ozekri/gdds) - **Project Page:** [https://oussamazekri.fr/gdds](https://oussamazekri.fr/gdds) ## Included Checkpoints | File | Method | Notes | | --- | --- | --- | | `checkpoints/gdds_gauss_500k.ckpt` | GDDS | 500k-step checkpoint with the Gaussian SIK forward process | | `checkpoints/gdds_uniform_500k.ckpt` | GDDS | 500k-step checkpoint with the uniform forward process | | `checkpoints/gdds_absorb_500k.ckpt` | GDDS | 500k-step checkpoint with the absorbing forward process | | `checkpoints/mdlm_500k.ckpt` | MDLM | 500k-step baseline checkpoint | | `checkpoints/udlm_500k.ckpt` | UDLM | 500k-step baseline checkpoint | | `checkpoints/ar_500k.ckpt` | AR | 500k-step autoregressive baseline checkpoint | ## Usage These files are PyTorch Lightning checkpoints intended to be used with the [`gdds`](https://github.com/ozekri/gdds) codebase. ```bash git clone https://github.com/ozekri/gdds.git cd gdds pip install -r requirements.txt pip install -e . # Example evaluation using a checkpoint PYTHONPATH=src python -m discrete_diffusion.evaluations.ppl_eval \ data=openwebtext \ model=small \ algo=mdlm \ eval.checkpoint_path=/path/to/checkpoints/mdlm_500k.ckpt ``` For sampling and other evaluations, use the same repository and pass the relevant checkpoint path through the Hydra evaluation config. ## Citation ```bibtex @misc{zekri2026generalizeddiscretediffusionsnapshots, title={Generalized Discrete Diffusion from Snapshots}, author={Oussama Zekri and Th{\\'e}o Uscidda and Nicolas Boull{\\'e} and Anna Korba}, year={2026}, eprint={2603.21342}, archivePrefix={arXiv}, primaryClass={stat.ML}, url={https://arxiv.org/abs/2603.21342}, } ```