Initialize model card
Browse files
README.md
CHANGED
|
@@ -1,3 +1,45 @@
|
|
| 1 |
---
|
| 2 |
-
license:
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
+
license: other
|
| 3 |
+
library_name: pytorch
|
| 4 |
+
pipeline_tag: text-generation
|
| 5 |
+
tags:
|
| 6 |
+
- discrete-diffusion
|
| 7 |
+
- diffusion-language-model
|
| 8 |
+
- self-correction
|
| 9 |
+
- scdd
|
| 10 |
+
- icml-2026
|
| 11 |
+
datasets:
|
| 12 |
+
- openwebtext
|
| 13 |
---
|
| 14 |
+
|
| 15 |
+
# SCDD
|
| 16 |
+
|
| 17 |
+
This repository contains the released checkpoints for **Generalized Discrete Diffusion with Self-Correction**.
|
| 18 |
+
|
| 19 |
+
SCDD is a self-correcting discrete diffusion language model. It learns to revise incorrect visible tokens directly during generation, preserving parallel decoding without a remasking step.
|
| 20 |
+
|
| 21 |
+
## Checkpoints
|
| 22 |
+
|
| 23 |
+
| File | Model | Uniform noise ratio |
|
| 24 |
+
| --- | --- | --- |
|
| 25 |
+
| `checkpoints/scdd_pu_0.1.ckpt` | SCDD (0.1) | `p_u = 0.1` |
|
| 26 |
+
| `checkpoints/scdd_pu_0.2.ckpt` | SCDD (0.2) | `p_u = 0.2` |
|
| 27 |
+
|
| 28 |
+
The checkpoint filenames intentionally use `scdd` naming for the public release.
|
| 29 |
+
|
| 30 |
+
## Code
|
| 31 |
+
|
| 32 |
+
Code and evaluation scripts are available at:
|
| 33 |
+
|
| 34 |
+
<https://github.com/laaaarrywang/Self-Correcting-Discrete-Diffusion>
|
| 35 |
+
|
| 36 |
+
## Citation
|
| 37 |
+
|
| 38 |
+
```bibtex
|
| 39 |
+
@article{wang2026generalized,
|
| 40 |
+
title={Generalized Discrete Diffusion with Self-Correction},
|
| 41 |
+
author={Wang, Linxuan and Wang, Ziyi and Bai, Yikun and Deng, Wei and Lin, Guang and Song, Qifan},
|
| 42 |
+
journal={arXiv preprint arXiv:2603.02230},
|
| 43 |
+
year={2026}
|
| 44 |
+
}
|
| 45 |
+
```
|