Please find GGUF Quants for the model here: QuantFactory/NuminiLlama-3.1-8B-GGUF
Thanks a lot! I'll need to update the model in the future but I'll let you know when it's ready
· Sign up or log in to comment