File size: 508 Bytes
3d49912
 
 
 
 
 
 
 
 
 
 
 
3846650
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
---
title: GPUguesstimator
emoji: 🌍
colorFrom: pink
colorTo: red
sdk: gradio
sdk_version: 6.1.0
app_file: app.py
pinned: false
license: apache-2.0
---

# LLM GPU Sizer (Gradio)

This Space estimates:
- VRAM for model weights + KV cache (worst-case per concurrency)
- number of GPUs required (with headroom)
- TTFT and ITL (anchor-based simulation)
- optionally reads TTFT/ITL from a running vLLM server `/metrics`

## Local dev (uv)
```bash
uv venv
uv pip install -r requirements.txt
uv run python app.py