divelab
/

OPDLM-8B

Text Generation

diffusion-language-model

on-policy-distillation

Model card Files Files and versions

shubhamprshr commited on about 18 hours ago

Commit

1aa9ccf

·

verified ·

1 Parent(s): af0d73b

Update README.md

Files changed (1) hide show

README.md +4 -5

README.md CHANGED Viewed

@@ -12,13 +12,13 @@ pipeline_tag: text-generation
 base_model: Qwen/Qwen3-8B
 datasets:
 - divelab/opdlm_train_data
 ---
 # OPDLM-8B
 OPDLM-8B is a block diffusion language model (DLM) obtained by post-training an
 autoregressive language model (ARLM) into a diffusion language model via
-**on-policy distillation**. Arxiv Report- arxiv.org/abs/2606.06712
 ## Highlights
 - **Converted, not pretrained from scratch:** built from a strong ARLM, reusing its prior.
@@ -40,7 +40,6 @@ autoregressive language model (ARLM) into a diffusion language model via
 - **Data:** [opdlm_train_data](https://huggingface.co/datasets/divelab/opdlm_train_data)
 ## Evaluation
 | Benchmark   | Score |
 |-------------|-------|
 | MMLU        | 70.9  |
@@ -65,6 +64,6 @@ Decoding: static (one token per step)
       eprint={2606.06712},
       archivePrefix={arXiv},
       primaryClass={cs.CL},
-      url={https://arxiv.org/abs/2606.06712},
 ```

 base_model: Qwen/Qwen3-8B
 datasets:
 - divelab/opdlm_train_data
+arxiv: 2606.06712
 ---
 # OPDLM-8B
 OPDLM-8B is a block diffusion language model (DLM) obtained by post-training an
 autoregressive language model (ARLM) into a diffusion language model via
+**on-policy distillation**. arXiv report: [arxiv.org/abs/2606.06712](https://arxiv.org/abs/2606.06712)
 ## Highlights
 - **Converted, not pretrained from scratch:** built from a strong ARLM, reusing its prior.
 - **Data:** [opdlm_train_data](https://huggingface.co/datasets/divelab/opdlm_train_data)
 ## Evaluation
 | Benchmark   | Score |
 |-------------|-------|
 | MMLU        | 70.9  |
       eprint={2606.06712},
       archivePrefix={arXiv},
       primaryClass={cs.CL},
+      url={https://arxiv.org/abs/2606.06712},
+}
 ```