metadata
tags:
- gguf
- llama.cpp
- unsloth
gemma3-1b-query-optimizer : GGUF
This model was finetuned and converted to GGUF format using Unsloth.
Example usage:
- For text only LLMs:
./llama.cpp/llama-cli -hf VijayRam1812/gemma3-1b-query-optimizer --jinja - For multimodal models:
./llama.cpp/llama-mtmd-cli -hf VijayRam1812/gemma3-1b-query-optimizer --jinja
Available Model files:
gemma-3-1b-it.Q4_K_M.gguf
Ollama
An Ollama Modelfile is included for easy deployment.
Note
The model's BOS token behavior was adjusted for GGUF compatibility.
This was trained 2x faster with Unsloth
