decode(): report live-path GPU ms/tok for clean CPU/GPU split d2afec3 verified Humuhumu33 commited on about 14 hours ago
spec-decode for BitNet: subNorm+f32-KV batched verify, ?spec + ?bench=spec d4f3433 verified Humuhumu33 commited on about 17 hours ago