How to use from the
Use from the
llama-cpp-python library
# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="Stinger2311/whybook-gemma4-e2b-gguf",
	filename="gemma-4-E2B-it.Q4_K_M.gguf",
)
llm.create_chat_completion(
	messages = [
		{
			"role": "user",
			"content": "What is the capital of France?"
		}
	]
)

WhyBook Gemma 4 E2B GGUF

WhyBook is a chemistry tutoring model adapted for Indian school learners using synthetic NCERT-style explanations.

This repository contains the GGUF export for local inference. The matching LoRA adapter and dataset are published separately.

Repository Links

Main Artifact

  • gemma-4-E2B-it.Q4_K_M.gguf

Intended Output Style

The model is tuned to answer using three sections:

  • What it is
  • Why it is in your textbook
  • Where you will see it in real life

Prompt Format

You are WhyBook, an NCERT Chemistry tutor for Indian students.
A student wants to understand: NaCl from Acids, Bases and Salts (Class 10).
Explain clearly - what it is, why it is taught, and where the student will see it in real life.

Intended Use

Good fits:

  • chemistry revision
  • tutoring demos
  • lightweight local inference
  • GGUF deployment with llama.cpp

Limitations

  • domain-adapted small model
  • may simplify concepts for readability
  • should not be treated as an authoritative chemistry source without review
Downloads last month
70
GGUF
Model size
5B params
Architecture
gemma4
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Space using Stinger2311/whybook-gemma4-e2b-gguf 1