shubhamprshr commited on
Commit
1aa9ccf
·
verified ·
1 Parent(s): af0d73b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -5
README.md CHANGED
@@ -12,13 +12,13 @@ pipeline_tag: text-generation
12
  base_model: Qwen/Qwen3-8B
13
  datasets:
14
  - divelab/opdlm_train_data
 
15
  ---
16
-
17
  # OPDLM-8B
18
 
19
  OPDLM-8B is a block diffusion language model (DLM) obtained by post-training an
20
  autoregressive language model (ARLM) into a diffusion language model via
21
- **on-policy distillation**. Arxiv Report- arxiv.org/abs/2606.06712
22
 
23
  ## Highlights
24
  - **Converted, not pretrained from scratch:** built from a strong ARLM, reusing its prior.
@@ -40,7 +40,6 @@ autoregressive language model (ARLM) into a diffusion language model via
40
  - **Data:** [opdlm_train_data](https://huggingface.co/datasets/divelab/opdlm_train_data)
41
 
42
  ## Evaluation
43
-
44
  | Benchmark | Score |
45
  |-------------|-------|
46
  | MMLU | 70.9 |
@@ -65,6 +64,6 @@ Decoding: static (one token per step)
65
  eprint={2606.06712},
66
  archivePrefix={arXiv},
67
  primaryClass={cs.CL},
68
- url={https://arxiv.org/abs/2606.06712},
69
-
70
  ```
 
12
  base_model: Qwen/Qwen3-8B
13
  datasets:
14
  - divelab/opdlm_train_data
15
+ arxiv: 2606.06712
16
  ---
 
17
  # OPDLM-8B
18
 
19
  OPDLM-8B is a block diffusion language model (DLM) obtained by post-training an
20
  autoregressive language model (ARLM) into a diffusion language model via
21
+ **on-policy distillation**. arXiv report: [arxiv.org/abs/2606.06712](https://arxiv.org/abs/2606.06712)
22
 
23
  ## Highlights
24
  - **Converted, not pretrained from scratch:** built from a strong ARLM, reusing its prior.
 
40
  - **Data:** [opdlm_train_data](https://huggingface.co/datasets/divelab/opdlm_train_data)
41
 
42
  ## Evaluation
 
43
  | Benchmark | Score |
44
  |-------------|-------|
45
  | MMLU | 70.9 |
 
64
  eprint={2606.06712},
65
  archivePrefix={arXiv},
66
  primaryClass={cs.CL},
67
+ url={https://arxiv.org/abs/2606.06712},
68
+ }
69
  ```