Broken

by Aryanne - opened Jul 17, 2023

Jul 17, 2023

open_llama_3b-q4_0-ggjt.bin seems to be broken, doesn't run on koboldcpp, it gives an error about dimensionality of some weight somewhere.

LLukas22

rustformers org Jul 17, 2023

koboldcpp is based on llama.cpp which has hardcoded sizes for the different llama architectures. Meaning 3B isn't in there yet. You could contribute it to koboldcpp or use rustformers/llm which calculates model sizes dynamically.

Aryanne

Jul 17, 2023

ok, I was running https://huggingface.co/SlyEcho/open_llama_3b_ggml and it was working fine

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment