How to use barnybug/stack-llama-2-ggml with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("barnybug/stack-llama-2-ggml", dtype="auto")