Spaces:
Sleeping
Sleeping
| title: GPUguesstimator | |
| emoji: ๐ | |
| colorFrom: pink | |
| colorTo: red | |
| sdk: gradio | |
| sdk_version: 6.1.0 | |
| app_file: app.py | |
| pinned: false | |
| license: apache-2.0 | |
| # LLM GPU Sizer (Gradio) | |
| This Space estimates: | |
| - VRAM for model weights + KV cache (worst-case per concurrency) | |
| - number of GPUs required (with headroom) | |
| - TTFT and ITL (anchor-based simulation) | |
| - optionally reads TTFT/ITL from a running vLLM server `/metrics` | |
| ## Local dev (uv) | |
| ```bash | |
| uv venv | |
| uv pip install -r requirements.txt | |
| uv run python app.py | |