d3LLM
/

d3LLM_LLaDA

+---
+datasets:
+- d3LLM-Data-LLaDA/trajectory_data_llada_32
+tags:
+- diffusion
+- text-generation
+- fast-inference
+- d3llm
+pipeline_tag: text-generation
+---
+# d3LLM: Ultra-Fast Diffusion LLM using Pseudo-Trajectory Distillation 🚀
+## Model Description
+**d3LLM-LLaDA** is an ultra-fast diffusion language model that achieves high generation speed while maintaining competitive performance. Built on the Dream architecture.
+## Key Features
+- 🚀 **4.9× faster** than autoregressive models (Qwen-2.5-7B-it) on H100 GPU
+- 🎯 **3.5× faster** on A100 GPU
+- ⚡ **280.97 tokens/s** on H100 (vs 57.32 for AR baseline)
+- 📊 High AUP (Accuracy Under Parallelism) scores across benchmarks
+- 🔧 Optimized for coding and math reasoning tasks
+## Usage
+For detailed usage instructions, evaluation scripts, and training code, please refer to the official GitHub repository:
+👉 **[https://github.com/hao-ai-lab/text-diffusion](https://github.com/hao-ai-lab/text-diffusion)**