Instructions to use internlm/internlm-xcomposer2d5-7b with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use internlm/internlm-xcomposer2d5-7b with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("visual-question-answering", model="internlm/internlm-xcomposer2d5-7b")# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("internlm/internlm-xcomposer2d5-7b", dtype="auto") - Notebooks
- Google Colab
- Kaggle
Run this model on multiple 24GB cards?
#9
by LemonBranny - opened
It seems that there is not enough video memory to run this model on a single 24GB card. Can I run this model on multiple 24GB cards to reduce the video memory usage of each card?
Any suggestions are greatly appreciated.
same
same here. tried to quantize it to 8bits no success, weird errors, 4bit will be unusable