fp8 fmt for hopper ep deploy?
#98
by whybeyoung - opened
we wan to deploy k2.5 with p/d disaggregation deploy , which decode deploy as ep16 on hopper series... do we have fp8 fmt ?
we wan to deploy k2.5 with p/d disaggregation deploy , which decode deploy as ep16 on hopper series... do we have fp8 fmt ?