How to use from the
Use from the
llama-cpp-python library
# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="olegshulyakov/codegemma-1.1-2b-GGUF",
	filename="",
)
output = llm(
	"Once upon a time,",
	max_tokens=512,
	echo=True
)
print(output)

codegemma-1.1-2b

Model creator: google
Original model: google/codegemma-1.1-2b
GGUF quantization: provided by olegshulyakov using llama.cpp

Special thanks

🙏 Special thanks to Georgi Gerganov and the whole team working on llama.cpp for making all of this possible.

Use with Ollama

ollama run "hf.co/olegshulyakov/codegemma-1.1-2b-GGUF:Q5_K_XL"

Use with LM Studio

lms load "olegshulyakov/codegemma-1.1-2b-GGUF"

Use with llama.cpp CLI

llama-cli -hf olegshulyakov/codegemma-1.1-2b-GGUF:Q5_K_XL -p "The meaning to life and the universe is"

Use with llama.cpp Server:

llama-server -hf olegshulyakov/codegemma-1.1-2b-GGUF:Q5_K_XL -c 4096
Downloads last month
2
GGUF
Model size
3B params
Architecture
gemma
Hardware compatibility
Log In to add your hardware

5-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for olegshulyakov/codegemma-1.1-2b-GGUF

Quantized
(3)
this model