Add runtime_config.json with optimal spec decode settings 39e7656 verified darkmaniac7 commited on 12 days ago
Update README with honest 3x averaged benchmark numbers 2d9453c verified darkmaniac7 commited on 12 days ago
Add config_opencl.json for OpenCL draft backend support 5531dd9 verified darkmaniac7 commited on 12 days ago
v3: KL-distilled 0.6B from Qwen3-8B (10K samples, KL=0.339, +41% uplift on SM8850) 1f8c190 verified darkmaniac7 commited on 13 days ago
docs: update README — abliterated draft with benchmarks 51a33fe verified darkmaniac7 commited on 16 days ago
feat: switch to abliterated draft (Huihui-Qwen3-0.6B-abliterated-v2) for better acceptance af3c044 verified darkmaniac7 commited on 16 days ago
feat: switch to abliterated draft (Huihui-Qwen3-0.6B-abliterated-v2) for better acceptance baa57ff verified darkmaniac7 commited on 16 days ago
feat: switch to abliterated draft (Huihui-Qwen3-0.6B-abliterated-v2) for better acceptance b9d2cac verified darkmaniac7 commited on 16 days ago
feat: switch to abliterated draft (Huihui-Qwen3-0.6B-abliterated-v2) for better acceptance 470043d verified darkmaniac7 commited on 16 days ago
feat: switch to abliterated draft (Huihui-Qwen3-0.6B-abliterated-v2) for better acceptance 33d29cb verified darkmaniac7 commited on 16 days ago