gemma-270m-thinking-0126 : GGUF
This model was finetuned and converted to GGUF format using Unsloth.
Example usage:
- For text only LLMs:
./llama.cpp/llama-cli -hf Ma7ee7/gemma-270m-thinking-0126 --jinja - For multimodal models:
./llama.cpp/llama-mtmd-cli -hf Ma7ee7/gemma-270m-thinking-0126 --jinja
Available Model files:
gemma-3-270m-it.Q8_0.gguf
Ollama
An Ollama Modelfile is included for easy deployment.
Note
The model's BOS token behavior was adjusted for GGUF compatibility.
This was trained 2x faster with Unsloth

- Downloads last month
- 47
Hardware compatibility
Log In
to add your hardware
8-bit
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support