decode(): report live-path GPU ms/tok for clean CPU/GPU split 8f34f10 verified Humuhumu33 commited on about 9 hours ago
spec-decode for BitNet: subNorm+f32-KV batched verify, ?spec + ?bench=spec 57800b6 verified Humuhumu33 commited on about 11 hours ago