Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -12,7 +12,7 @@ tags:
 ---
 # LLaMA3.1-8B-Instruct-DFlash-UltraChat
-[**Paper (Coming Soon)**](#) | [**GitHub**](https://github.com/z-lab/dflash) | [**Blog**](https://z-lab.ai/projects/dflash/)
 **DFlash** is a novel speculative decoding method that utilizes a lightweight **block diffusion** model for drafting. It enables efficient, high-quality parallel drafting that pushes the limits of inference speed.

 ---
 # LLaMA3.1-8B-Instruct-DFlash-UltraChat
+[**Paper**](https://arxiv.org/abs/2602.06036) | [**GitHub**](https://github.com/z-lab/dflash) | [**Blog**](https://z-lab.ai/projects/dflash/)
 **DFlash** is a novel speculative decoding method that utilizes a lightweight **block diffusion** model for drafting. It enables efficient, high-quality parallel drafting that pushes the limits of inference speed.