Request - int8 version
#23
by
erosdiffusion
- opened
Would it be possible to have the stack at int8. It seems 30xx (eg 3080) can benefit from that (lower size, faster inference)
erosdiffusion
changed discussion title from
Request - int8 fersion
to Request - int8 version