--- title: LLM Inference Pipeline Simulator emoji: 🚀 colorFrom: gray colorTo: blue sdk: static pinned: false --- # LLM Inference — Pipeline Simulator Interactive visualization of the LLM inference pipeline (prefill, decode, KV cache). From EXD Episode 3 — Inference Benchmarking. Built with vanilla HTML/CSS/JS. Zero dependencies. 📺 [Watch the episode](https://youtube.com/@EXD-ai) | 💻 [GitHub](https://github.com/Ramshreyas/EXD)