How to use from the
Use from the
llama-cpp-python library
# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="powerliftme/coach-gemma-e2b",
	filename="coach-gemma-e2b-q4.gguf",
)
llm.create_chat_completion(
	messages = [
		{
			"role": "user",
			"content": "What is the capital of France?"
		}
	]
)

PowerliftME · Coach Chat (Gemma 4 E2B, GGUF) — heavier tier

Optional higher-quality coach variant for the PowerliftME app. Same role as the Qwen3 1.7B coach: free-form training advice (effort, recovery, technique, nutrition, refusals). Program-specific facts come from the app's deterministic rules engine, never from this model.

  • Base: Gemma 4 E2B (Google)
  • Quant: Q4_K_M (imatrix) · ~3.25 GB
  • Languages: English + Russian

Run (llama.cpp)

llama-server -m coach-gemma-e2b-q4.gguf -c 2048

License

Apache 2.0, inherited from Gemma 4. Google released the Gemma 4 family under the standard Apache 2.0 license (April 2026) — no custom Gemma Terms of Use, no usage carve-outs. Same permissive terms as the Qwen models in this stack.

Downloads last month
346
GGUF
Model size
5B params
Architecture
gemma4
Hardware compatibility
Log In to add your hardware

We're not able to determine the quantization variants.

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for powerliftme/coach-gemma-e2b

Quantized
(268)
this model