Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Spaces:
atharv6f
/
flash-attention-explorer
Sleeping

App Files Files Community
Fetching metadata from the HF Docker repository...
flash-attention-explorer / src
  • 1 contributor
History: 28 commits
a0y0346
Fix Online Softmax: move subplot titles inside chart area (y=0.95)
7720e5f about 1 month ago
  • __init__.py
    43 Bytes
    Phase 1: Core structure with model configs and placeholder tabs about 1 month ago
  • attention_utils.py
    13.6 kB
    fix: Add fallback SDPA benchmark when attention layer fails about 1 month ago
  • benchmark.py
    37.3 kB
    fix: Add fallback SDPA benchmark when attention layer fails about 1 month ago
  • constants.py
    2.85 kB
    Add H200 GPU support and improve roofline chart visibility about 1 month ago
  • gqa.py
    21.9 kB
    Fix GQA scaling chart x-axis: use log scale to space tick labels properly about 1 month ago
  • memory_budget.py
    19.7 kB
    Add separate KV Cache dtype selector (FP16/BF16/FP8/INT8) about 1 month ago
  • models.py
    6.05 kB
    Phase 1: Core structure with model configs and placeholder tabs about 1 month ago
  • prefill_decode.py
    35.6 kB
    Refactor benchmarks to use real model.config values about 1 month ago
  • visualizer.py
    15.7 kB
    Fix Online Softmax: move subplot titles inside chart area (y=0.95) about 1 month ago