Update README.md
Browse files
README.md
CHANGED
|
@@ -12,7 +12,7 @@ tags:
|
|
| 12 |
---
|
| 13 |
|
| 14 |
# LLaMA3.1-8B-Instruct-DFlash-UltraChat
|
| 15 |
-
[**Paper
|
| 16 |
|
| 17 |
**DFlash** is a novel speculative decoding method that utilizes a lightweight **block diffusion** model for drafting. It enables efficient, high-quality parallel drafting that pushes the limits of inference speed.
|
| 18 |
|
|
|
|
| 12 |
---
|
| 13 |
|
| 14 |
# LLaMA3.1-8B-Instruct-DFlash-UltraChat
|
| 15 |
+
[**Paper**](https://arxiv.org/abs/2602.06036) | [**GitHub**](https://github.com/z-lab/dflash) | [**Blog**](https://z-lab.ai/projects/dflash/)
|
| 16 |
|
| 17 |
**DFlash** is a novel speculative decoding method that utilizes a lightweight **block diffusion** model for drafting. It enables efficient, high-quality parallel drafting that pushes the limits of inference speed.
|
| 18 |
|