Model checkpoints accompanying the paper "Scaling Behavior of Discrete Diffusion Language Models" (https://arxiv.org/abs/2512.10858).
Dimitri von Rütte
dvruette
AI & ML interests
None yet
Organizations
Scaling Behavior of Discrete Diffusion Language Models
Model checkpoints accompanying the paper "Scaling Behavior of Discrete Diffusion Language Models" (https://arxiv.org/abs/2512.10858).
Generalized Interpolating Discrete Diffusion
OpenWebText BPE
BPE tokenizers with vocab sizes between 1k and 131k trained on OpenWebText, as well as the pre-tokenized dataset for each of them.