How to use senseable/mlx_dolphin-2.6-phi-2_4-bit with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("senseable/mlx_dolphin-2.6-phi-2_4-bit", dtype="auto")