How to use from the
Use from the
llama-cpp-python library
# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="marcsixtysix/gemma-3-1b-it-assembler_w",
	filename="gemma-3-1b-it-assembler_w.gguf",
)
llm.create_chat_completion(
	messages = [
		{
			"role": "user",
			"content": "What is the capital of France?"
		}
	]
)

Uploaded finetuned model

  • Developed by: marcsixtysix
  • License: apache-2.0
  • Finetuned from model : unsloth/gemma-3-1b-it
Downloads last month
45
Safetensors
Model size
1.0B params
Tensor type
BF16
·
F16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for marcsixtysix/gemma-3-1b-it-assembler_w

Quantized
(17)
this model