is there a FP8 version?

#2
by yejingfu - opened

The dtype should be set "bfloat16".
Would you train or quantize FP8 version of spec draft model?

yes. the dtype is "bfloat16", @jisenli can you change this. thanks.

Sign up or log in to comment