File size: 3,436 Bytes
d2253eb | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 | # Prism: Efficient Test-Time Scaling via Hierarchical Search and Self-Verification for Discrete Diffusion Language Models
## PRISM: Pruning, Remasking, and Integrated Self-verification Method
PRISM is an efficient inference framework designed for **Discrete Diffusion Language Models (dLLMs)**, focusing on a favorable performance-efficiency trade-off by matching Best-of-N performance with substantially fewer Function Evaluations (NFE).
[](https://arxiv.org/abs/2602.01842)
[](https://github.com/viiika/Prism)
### Method

### Experiments

### Project Structure
```text
PRISM/
βββ Dream/ # Experiments for Dream
β βββ Dream_Baseline/ # Standard baseline sampling (N=1)
β βββ Dream_Prism/ # Prism implementation
βββ LLaDA/ # Experiments for LLaDA 8B Instruct
β βββ LLaDA_Baseline/ # Standard baseline sampling (N=1)
β βββ LLaDA_Prism/ # PRISM implementation
β βββ LLaDA_Truthfulqa/ # TruthfulQA evaluation
βββ LLaDA2mini/ # Experiments for LLaDA 2.0-mini
βββ LLaDA2mini_Baseline/ # Standard baseline sampling (N=1)
βββ LLaDA2mini_Prism/ # Prism implementation
```
### Prerequisites
```bash
cd PRISM
```
For Dream Project:
```bash
cd Dream/Dream_Prism/eval_instruct
pip install -e .
```
For LLaDA_Truthfulqa:
```bash
cd LLaDA/LLaDA_Truthfulqa/lm-evaluation-harness
pip install -e .
```
For LLaDA and LLaDA2 Projects:
```bash
cd LLaDA/LLaDA_Prism
pip install -r requirements.txt
```
#### Quick Start
Evaluate Dream
```bash
cd Dream/Dream_Prism
bash scripts/run_gsm8k.sh
bash scripts/run_humaneval.sh
bash scripts/run_math500.sh
bash scripts/run_mbpp.sh
```
Evaluate LLaDA 8B Instruct
```bash
cd LLaDA/LLaDA_Prism
bash scripts/run_gsm8k.sh
bash scripts/run_humaneval.sh
bash scripts/run_math500.sh
bash scripts/run_mbpp.sh
```
Evaluate LLaDA 8B Instruct(Truthfulqa)
```bash
cd LLaDA/LLaDA_Truthfulqa
bash scripts/llada_prism.sh
```
Evaluate LLaDA 2.0-mini
```bash
cd LLaDA2mini/LLaDA2mini_Prism
bash scripts/run_gsm8k.sh
bash scripts/run_humaneval.sh
bash scripts/run_math500.sh
bash scripts/run_mbpp.sh
```
### Evaluation & Metrics
Each project folder contains a metrics/ directory used for calculating final accuracy and efficiency metrics.
Usage Example:
```bash
python PRISM/LLaDA/LLaDA_Prism/metrics/gsm8k_all.py
```
### Acknowledgements
This project is built upon [preordinary/LLaDA2](https://github.com/preordinary/LLaDA2), [ML-GSAI/LLaDA](https://github.com/ML-GSAI/LLaDA), [DreamLM/Dream](https://github.com/DreamLM/Dream) and [EleutherAI/lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness). Special thanks to the authors for their contributions.
### π Citation
If you find this work helpful, please consider citing:
```bibtex
@article{bai2026prism,
title={Prism: Efficient Test-Time Scaling via Hierarchical Search and Self-Verification for Discrete Diffusion Language Models},
author={Bai, Jinbin and Li, Yixuan and Zhu, Yuchen and Xin, Yi and Shi, Qingyu and Feng, Aosong and Liu, Xiaohong and Tao, Molei and Xue, Jianru and Li, Xiangtai and Yang, Ming-Hsuan},
journal={arXiv preprint arXiv:2602.01842},
year={2026}
}
```
|