Edit-Based Refinement for Parallel Masked Diffusion Language Models

Introduction

ME-DLM is a lightweight edit-based refinement framework for masked diffusion language models. It first generates a complete response through parallel diffusion decoding, then refines the output with minimal edit operations such as replacement, deletion, and insertion, conditioned on the full sequence. By using edit distance as deterministic training supervision, ME-DLM improves sequence-level consistency while preserving the decoding efficiency of diffusion models. Built on LLaDA, it achieves consistent gains on HumanEval and GSM8K while using only one-eighth of the total diffusion steps.

Models

Model	Checkpoint
ME-DLM Stage 1	🤗 HF Link
ME-DLM Stage 2	🤗 HF Link
ME-DLM Stage 3	🤗 HF Link

Acknowledgments

We thank the following amazing projects that truly inspired us:

LLaDA

Downloads last month: -

Safetensors

Model size

9B params

Tensor type

BF16

Model tree for renhouxing/ME-DLM-Stage1

Base model

GSAI-ML/LLaDA-8B-Base

Finetuned

(7)

this model

Paper for renhouxing/ME-DLM-Stage1

Edit-Based Refinement for Parallel Masked Diffusion Language Models

Paper • 2605.09603 • Published 3 days ago