How to use TheBloke/30B-Epsilon-GGML with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("TheBloke/30B-Epsilon-GGML", dtype="auto")