Instructions to use microsoft/bloom-deepspeed-inference-fp16 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use microsoft/bloom-deepspeed-inference-fp16 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("feature-extraction", model="microsoft/bloom-deepspeed-inference-fp16")# Load model directly from transformers import AutoTokenizer, AutoModel tokenizer = AutoTokenizer.from_pretrained("microsoft/bloom-deepspeed-inference-fp16") model = AutoModel.from_pretrained("microsoft/bloom-deepspeed-inference-fp16") - Notebooks
- Google Colab
- Kaggle
How to split tensors to x shards?
👍 2
2
#1 opened almost 3 years ago
by
Ede-CH