How to use zai-org/chatglm-6b-int8 with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("zai-org/chatglm-6b-int8", trust_remote_code=True, dtype="auto")