Spaces:
Sleeping
Sleeping
File size: 508 Bytes
3d49912 3846650 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 | ---
title: GPUguesstimator
emoji: 🌍
colorFrom: pink
colorTo: red
sdk: gradio
sdk_version: 6.1.0
app_file: app.py
pinned: false
license: apache-2.0
---
# LLM GPU Sizer (Gradio)
This Space estimates:
- VRAM for model weights + KV cache (worst-case per concurrency)
- number of GPUs required (with headroom)
- TTFT and ITL (anchor-based simulation)
- optionally reads TTFT/ITL from a running vLLM server `/metrics`
## Local dev (uv)
```bash
uv venv
uv pip install -r requirements.txt
uv run python app.py
|