Aurora_GGUF_Final : GGUF

This model was finetuned and converted to GGUF format using Unsloth.

Example usage:

For text only LLMs: ./llama.cpp/llama-cli -hf kiritzzo1612/Aurora_GGUF_Final --jinja
For multimodal models: ./llama.cpp/llama-mtmd-cli -hf kiritzzo1612/Aurora_GGUF_Final --jinja

Available Model files:

An Ollama Modelfile is included for easy deployment.

The model's BOS token behavior was adjusted for GGUF compatibility. This was trained 2x faster with Unsloth

GGUF

Model size

9B params

Architecture

gemma2

Hardware compatibility

4-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support