decode(): report live-path GPU ms/tok for clean CPU/GPU split d2afec3 verified Humuhumu33 commited on about 11 hours ago
per-pass GPU trace ?bench=trace: name the non-weight overhead 31a05bf verified Humuhumu33 commited on about 11 hours ago
spec-decode for BitNet: subNorm+f32-KV batched verify, ?spec + ?bench=spec d4f3433 verified Humuhumu33 commited on about 14 hours ago
spec-decode for BitNet: subNorm+f32-KV batched verify, ?spec + ?bench=spec 459725d verified Humuhumu33 commited on about 14 hours ago
Upload core/loader.js with huggingface_hub 484edea verified Humuhumu33 commited on about 23 hours ago