Commit History

add opd-32b-v33-s150-gptq-w4a16 + dflash phaseL int4mlp-gptq draft
39037e6
verified

ycchen commited on

add dflash-32b-draft-v2test-phaseL-int4mlp (int4-MLP draft)
f7dd3b2
verified

ycchen commited on

add opd-32b-v33-s200-gptq-w4a16 (sink-on + long-ctx + fp8 KV scale)
39aa530
verified

ycchen commited on

card: add dflash-32b-draft-v2test-phaseL (phase-2 long-ctx final, job 140680); mark phase-1 as warm-up
ef8af61
verified

ycchen commited on

card: fix opd-32b-deploy provenance (v33/job135076/step_200, not the V32 158-collapse) + add opd-32b-v33-s150
37c0b61
verified

ycchen commited on

index: add opd-32b-deploy + dflash-32b-draft-v2test
52b5609
verified

ycchen commited on

Add bundle README
3596238
verified

ycchen commited on