Instructions to use zai-org/chatglm-6b-int8 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use zai-org/chatglm-6b-int8 with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("zai-org/chatglm-6b-int8", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
Commit History
Upload pytorch_model.bin 22906ae
Update slack link 79f4fdc
Update decode method in tokenizer f2182c9
duzx16 commited on
Add support for parallel quantization on Mac a697125
duzx16 commited on
Remove assert in load_cpu_kernel 3218e92
duzx16 commited on
Sync with chatglm-6b 216185d
duzx16 commited on
Remove pytorch_model.bin.index.json 2cc27c9
duzx16 commited on
Add pytorch_model.bin.index.json 9a41621
duzx16 commited on
Update slack link c7cc00a
duzx16 commited on
Upload ice_text.model 9076e37
Update README.md 8a66e07
duzx16 commited on
Init commit fb85b4d
duzx16 commited on
Upload pytorch_model.bin f8df870
initial commit f2d45d6
Zhengxiao Du commited on