Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
atharv6f
/
flash-attention-explorer
like
0
Sleeping
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
flash-attention-explorer
/
src
1 contributor
History:
28 commits
a0y0346
Fix Online Softmax: move subplot titles inside chart area (y=0.95)
7720e5f
about 1 month ago
__init__.py
Safe
43 Bytes
Phase 1: Core structure with model configs and placeholder tabs
about 1 month ago
attention_utils.py
Safe
13.6 kB
fix: Add fallback SDPA benchmark when attention layer fails
about 1 month ago
benchmark.py
Safe
37.3 kB
fix: Add fallback SDPA benchmark when attention layer fails
about 1 month ago
constants.py
Safe
2.85 kB
Add H200 GPU support and improve roofline chart visibility
about 1 month ago
gqa.py
Safe
21.9 kB
Fix GQA scaling chart x-axis: use log scale to space tick labels properly
about 1 month ago
memory_budget.py
Safe
19.7 kB
Add separate KV Cache dtype selector (FP16/BF16/FP8/INT8)
about 1 month ago
models.py
Safe
6.05 kB
Phase 1: Core structure with model configs and placeholder tabs
about 1 month ago
prefill_decode.py
Safe
35.6 kB
Refactor benchmarks to use real model.config values
about 1 month ago
visualizer.py
Safe
15.7 kB
Fix Online Softmax: move subplot titles inside chart area (y=0.95)
about 1 month ago