jsttlgdkycy
/

NI_Sampling

Model card Files Files and versions

xet

Community

Add model card and link to paper

by nielsr HF Staff - opened 28 days ago

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

+39

-3

Files changed (1) hide show

README.md +39 -3

README.md CHANGED Viewed

@@ -1,3 +1,39 @@
----
-license: cc-by-4.0
----

+---
+license: cc-by-4.0
+pipeline_tag: text-classification
+---
+# NI Sampling: Accelerating Discrete Diffusion Sampling by Token Order Optimization
+This repository contains the trained indicator for **Neural Indicator Sampling (NI Sampling)**, a framework designed to accelerate the sampling process of discrete diffusion Large Language Models (dLLMs).
+NI Sampling utilizes a neural indicator to decide which tokens should be sampled at each step, leveraging correct predictions to reduce the number of sampling iterations by an order of magnitude. Experiments on models like LLaDA and Dream show up to 14.3x acceleration with negligible performance drop.
+- **Paper:** [NI Sampling: Accelerating Discrete Diffusion Sampling by Token Order Optimization](https://huggingface.co/papers/2604.18471)
+- **Code:** [GitHub Repository](https://github.com/imagination-research/NI-Sampling)
+## Overview
+Neural Indicator Sampling (NI Sampling) is a novel framework designed to accelerate the sampling process of diffusion Large Language Models (LLMs). By training a lightweight neural indicator, we can dynamically predict which tokens should be sampled at each step, significantly reducing redundant computations while maintaining high generation quality.
+## Evaluation
+To evaluate the indicator on benchmarks using the official implementation, you can use commands like the following:
+```bash
+# GSM8K
+accelerate launch --main_process_port 11450 --num_processes 1 eval_llada.py --tasks gsm8k --model llada_dist --model_args model_path='GSAI-ML/LLaDA-8B-Instruct',gen_length=256,steps=256,block_length=64,prob_threshold=0.95,indicator_path="indicator_LLaDA.pth",indicator_threshold=0.89,use_indicator=True
+# HumanEval
+accelerate launch --main_process_port 11450 --num_processes 1 eval_llada.py --tasks humaneval --model llada_dist --confirm_run_unsafe_code --model_args model_path='GSAI-ML/LLaDA-8B-Instruct',gen_length=256,steps=256,block_length=32,prob_threshold=0.95,indicator_path="indicator_LLaDA.pth",indicator_threshold=0.89,use_indicator=True
+```
+## Citation
+```bibtex
+@inproceedings{liuni,
+  title={NI Sampling: Accelerating Discrete Diffusion Sampling by Token Order Optimization},
+  author={Liu, Enshu and Ning, Xuefei and Wang, Yu and Lin, Zinan},
+  booktitle={The Fourteenth International Conference on Learning Representations}
+}
+```