Instructions to use microsoft/bloom-deepspeed-inference-fp16 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use microsoft/bloom-deepspeed-inference-fp16 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("feature-extraction", model="microsoft/bloom-deepspeed-inference-fp16")# Load model directly from transformers import AutoTokenizer, AutoModel tokenizer = AutoTokenizer.from_pretrained("microsoft/bloom-deepspeed-inference-fp16") model = AutoModel.from_pretrained("microsoft/bloom-deepspeed-inference-fp16") - Notebooks
- Google Colab
- Kaggle
rename non-tp
Browse files
BLOOM-non-tp.pt → non-tp.pt
RENAMED
|
File without changes
|