Could you provide INT4 quantized version

#2
by Fouye - opened

FP8 444GB is still too large.

INT4 wll also help with KTransformers

Sign up or log in to comment