Instructions to use izhx/udever-bloom-7b1 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use izhx/udever-bloom-7b1 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("feature-extraction", model="izhx/udever-bloom-7b1")# Load model directly from transformers import AutoTokenizer, AutoModel tokenizer = AutoTokenizer.from_pretrained("izhx/udever-bloom-7b1") model = AutoModel.from_pretrained("izhx/udever-bloom-7b1") - Notebooks
- Google Colab
- Kaggle
Running via apple MLX
I'm interested in running this using the MLX framework, but intelligently converting weights is beyond my understanding currently.
Does anyone have any guidance on getting this working, or steps to understand how to go about converting something like this to work with mlx?
I kind of get the whole mx.array thing, but how do you reason about converting weights?
There is a Bert example - https://github.com/ml-explore/mlx-examples/tree/main/bert
With the conversion script being - https://github.com/ml-explore/mlx-examples/blob/10a7b99e835b87b9f6d762fcb4de47b6f300f52e/bert/convert.py#L22
Seems mostly to be replacing a few keys, but what's the rationale? Is that just the tensor -> npz format stuff? Are there any other considerations?
Thanks!