DICE-4B: Diffusion Large Language Models Excel at Generating CUDA Kernels

DICE is a series of diffusion large language models (dLLMs) designed for CUDA kernel generation, spanning three parameter scales, 1.7B, 4B, and 8B.

Citation

@article{bai2026dice,
  title={DICE: Diffusion Large Language Models Excel at Generating CUDA Kernels},
  author={Bai, Haolei and Kong, Lingcheng and Chen, Xueyi and Wang, Jianmian and Tao, Zhiqiang and Wang, Huan},
  journal={arXiv preprint arXiv:2602.11715},
  year={2026}
}
Downloads last month
26
Safetensors
Model size
4B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including DeadlyKitt3n/DICE-4B

Paper for DeadlyKitt3n/DICE-4B