Instructions to use vcadillo/glm-4v-9b-4-bits with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use vcadillo/glm-4v-9b-4-bits with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("feature-extraction", model="vcadillo/glm-4v-9b-4-bits", trust_remote_code=True)# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("vcadillo/glm-4v-9b-4-bits", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
这个是用什么版本的transformers?{在线急}
1
#3 opened over 1 year ago
by
jackleef
4bit 是否能用cudnn进行推理加速
#2 opened almost 2 years ago
by
baiall