Apertus-70B-Instruct-2509

#1354
by spiritdude - opened

This model is unfortunately gated and so is currently not possible for us to access. Because of this I also have no idea if the architecture on which it is based on is even supported by llama.cpp as I can't access the files required to determine that. If you have access you could use https://huggingface.co/spaces/huggingface-projects/repo_duplicator to create a non-gated version of it. I already asked a friend to request access to it when he wakes up but no idea if he will get accepted and how long this process will take.

Our friend just got access to this model. Unfortunately the architecture is ApertusForCausalLM which is not currently supported by llama.cpp but likely to be supported in the near to medium future: Please follow: https://github.com/ggml-org/llama.cpp/issues/15748

Sign up or log in to comment