Running via apple MLX

by paulmaksimovich - opened Dec 19, 2023

•

I'm interested in running this using the MLX framework, but intelligently converting weights is beyond my understanding currently.

Does anyone have any guidance on getting this working, or steps to understand how to go about converting something like this to work with mlx?

I kind of get the whole mx.array thing, but how do you reason about converting weights?

With the conversion script being - https://github.com/ml-explore/mlx-examples/blob/10a7b99e835b87b9f6d762fcb4de47b6f300f52e/bert/convert.py#L22
Seems mostly to be replacing a few keys, but what's the rationale? Is that just the tensor -> npz format stuff? Are there any other considerations?

Thanks!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment