Instructions to use ibm-ai-platform/llama3-8b-accelerator with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use ibm-ai-platform/llama3-8b-accelerator with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("ibm-ai-platform/llama3-8b-accelerator", dtype="auto") - Notebooks
- Google Colab
- Kaggle
Unexpectedly Large Memory Usage of ibm-fms/llama3-8b-accelerator in vLLM
1
#4 opened over 1 year ago
by
baizhuoyan
llama3.1 version
1
#3 opened almost 2 years ago
by
amgadhasan
ValueError: Unsupported model type mlp_speculator using TGI server
2
#2 opened about 2 years ago
by
rishabh-simpplr
shard 0 never ready when given the speculator option?
7
#1 opened about 2 years ago
by
mhill4980