GGUF
conversational
How to use from the
Use from the
llama-cpp-python library
# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="rahul7star/gemma-gguf",
	filename="",
)
llm.create_chat_completion(
	messages = "No input example has been defined for this model task."
)

model-apex-i-quality.gguf => is gemma4 apex

model-gemma4-heretic-apexi-quality.gguf => Gemma4 Fast Heretic Model

DEMO

https://huggingface.co/spaces/rahul7star/apex-gguf

Downloads last month
16,705
GGUF
Model size
5B params
Architecture
gemma4
Hardware compatibility
Log In to add your hardware

We're not able to determine the quantization variants.

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for rahul7star/gemma-gguf

Quantized
(201)
this model

Dataset used to train rahul7star/gemma-gguf

Spaces using rahul7star/gemma-gguf 2