File size: 447 Bytes
036695c
7776776
 
036695c
7776776
036695c
 
 
 
7776776
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
---
title: LLM Inference Pipeline Simulator
emoji: πŸš€
colorFrom: gray
colorTo: blue
sdk: static
pinned: false
---

# LLM Inference β€” Pipeline Simulator

Interactive visualization of the LLM inference pipeline (prefill, decode, KV cache). From EXD Episode 3 β€” Inference Benchmarking.

Built with vanilla HTML/CSS/JS. Zero dependencies.

πŸ“Ί [Watch the episode](https://youtube.com/@EXD-ai) | πŸ’» [GitHub](https://github.com/Ramshreyas/EXD)