Request - int8 version

#23
by erosdiffusion - opened

Would it be possible to have the stack at int8. It seems 30xx (eg 3080) can benefit from that (lower size, faster inference)

erosdiffusion changed discussion title from Request - int8 fersion to Request - int8 version

Sign up or log in to comment