q / index.html

Commit History

keep-warm pulse: hold GPU boost clock while typing so casual chat decodes at boosted rate
70592b3
verified

Humuhumu33 commited on

warmup uses decode() to precompile batched pipelines (cut first-msg TTFT)
e4b6a8f
verified

Humuhumu33 commited on

fast first turn: warmup boosts clock + primes system-prompt KV (12s TTFT to ~0.5s) + static grounded greeting
7efaf4b
verified

Humuhumu33 commited on

discrete-GPU validation ?bench=discrete: bandwidth + live + spec-flip in one page
dabcb0d
verified

Humuhumu33 commited on

per-pass GPU trace ?bench=trace: name the non-weight overhead
770541b
verified

Humuhumu33 commited on

spec bench: warmup + short prompt + 192-tok decode-dominated (fix prefill confound)
439999f
verified

Humuhumu33 commited on

live decode profile ?bench=perf: boosted-clock steady tok/s vs roofline
e74320a
verified

Humuhumu33 commited on

spec-decode for BitNet: subNorm+f32-KV batched verify, ?spec + ?bench=spec
7406205
verified

Humuhumu33 commited on

Upload index.html with huggingface_hub
b50807f
verified

Humuhumu33 commited on

Upload index.html with huggingface_hub
1185393
verified

Humuhumu33 commited on

Upload index.html with huggingface_hub
abdc36e
verified

Humuhumu33 commited on

Upload index.html with huggingface_hub
cb2d355
verified

Humuhumu33 commited on

Upload index.html with huggingface_hub
7d0b228
verified

Humuhumu33 commited on

Upload folder using huggingface_hub
3365e13
verified

Humuhumu33 commited on

initial commit
3abea9c
verified

Humuhumu33 commited on