Commit History

Fix Online Softmax: move subplot titles inside chart area (y=0.95)
7720e5f

a0y0346 commited on

Fix Online Softmax charts: increase height (350px), margins, and y-axis range for labels
73045e3

a0y0346 commited on

Fix Memory Hierarchy: move labels further right (x=0.65), shorten arrow
83079d4

a0y0346 commited on

Fix Memory Hierarchy diagram: move transfer labels to right of arrow
1307f62

a0y0346 commited on

Fix GQA scaling chart x-axis: use log scale to space tick labels properly
287497c

a0y0346 commited on

Fix GQA tab chart overlaps
783749d

a0y0346 commited on

Add separate KV Cache dtype selector (FP16/BF16/FP8/INT8)
509d7b6

a0y0346 commited on

Add .gitignore, remove pycache from tracking
6492c04

a0y0346 commited on

Fix legend overlap in context scaling chart, increase batch size limit to 256
3de9ded

a0y0346 commited on

Fix text overlap in Memory Budget charts
aabd982

a0y0346 commited on

Add Memory Budget tab with real model and GPU detection
80187ae

a0y0346 commited on

Add GQA/MQA tab with real model configuration
a9b1e87

a0y0346 commited on

Refactor benchmarks to use real model.config values
af9b854

a0y0346 commited on

Fix NoneType comparison error in prefill/decode charts
c30936f

a0y0346 commited on

fix: Add fallback SDPA benchmark when attention layer fails
685194e

a0y0346 commited on

feat: Use real HuggingFace model attention layers for benchmarks
374d38b

a0y0346 commited on

Use scatter markers for attention pattern visualization
47751f7

a0y0346 commited on

Fix attention pattern heatmaps with standard colorscales
fb9f765

a0y0346 commited on

Fix Prefill vs Decode chart visibility issues
30d461e

a0y0346 commited on

Implement Prefill vs Decode tab (Phase 4)
473fcf2

a0y0346 commited on

Use compute capability + SM count for better GPU spec estimation
5642188

a0y0346 commited on

Add H200 GPU support and improve roofline chart visibility
e2de6cd

a0y0346 commited on

Auto-detect GPU instead of dropdown selection
d5ef20e

a0y0346 commited on

Make Roofline chart dynamic with measured benchmark results
3fec038

a0y0346 commited on

Add Phase 3: Benchmark tab with GPU performance measurement
05810bb

a0y0346 commited on

Fix Visualizer tab bugs: softmax data, reset button, causal toggle
ac3157f

a0y0346 commited on

Phase 2: Add Visualizer tab with tiling animation, online softmax, and memory hierarchy
c9bdf44

a0y0346 commited on

Phase 1: Core structure with model configs and placeholder tabs
341bde8

a0y0346 commited on