Estella Explainer LLM

An open-weights fine-tuned language model that generates child-friendly readability hints for elementary and middle school math word problems — trained on the pedagogical framework of Usable Math, an open educational resource developed at the University of Massachusetts Amherst.

Overview

Understanding a math word problem is a prerequisite to solving it. Research in mathematics education — including the foundational work of George Pólya (How to Solve It, 1945) and the Feynman Technique for conceptual clarity — establishes that students who cannot parse the language of a math problem are blocked from applying the mathematical operations they otherwise know.

Estella Explainer LLM is a domain-fine-tuned version of Google's Gemma 4 E4B model, trained to fulfill the role of Estella Explainer — a virtual reading and language coach from the Usable Math platform. Estella's function is precisely scoped: she helps students understand what a math problem is asking, without solving it. She does not compute answers. She does not show steps. She clarifies language, identifies known quantities, and names the unknown — using vocabulary accessible to children in grades 3 through 8.

This model is an open-weights alternative to the existing Estella Explainer Math Bot 2, which is powered by GPT-4 and requires a ChatGPT account. This model runs entirely open — no proprietary API, no account required.

Pedagogical Framework

Estella's Role in Usable Math

Usable Math (formerly 4mality) is a Google Slides-based interactive math tutoring system for grades 3–6, developed in the College of Education at the University of Massachusetts Amherst. Each math word problem in the system is accompanied by hints from four virtual coaches:

Estella Explainer — Reading and Language: restates the problem in simple language; identifies what is known and what is being asked
Chef Math Bear — Computation: suggests arithmetic operations
How-to-Hound — Strategy: offers problem-solving approaches (rounding, elimination, estimation)
Visual Vicuna — Visualization: suggests diagrams, charts, or drawings

This model exclusively replicates Estella's function.

Design Principles

Estella hints are governed by the following rules, encoded in the model's training:

Give exactly one hint. No multi-part explanations.
Do not solve the problem. The answer is never computed or stated.
Do not show calculation steps. The hint does not model arithmetic.
Identify what is known. State the given information clearly.
Identify what is unknown. Name what the student needs to find.
Use very simple language. Target Flesch Reading Ease 90–100 (very easy; equivalent to Grade 3–4 reading level).
Use characteristic phrases. "We know...", "We are looking for...", "Think about...", "Think: ..."
Be friendly and encouraging. Tone is warm, not academic.

Theoretical Grounding

Pólya's Problem-Solving Framework: Estella operationalizes the Understanding the Problem phase — Pólya's first step — by separating comprehension from computation.
Feynman Technique: Complex math vocabulary is broken into simple, concrete language, reflecting the principle that true understanding means being able to explain something simply.
Flesch-Kincaid Readability: Hints are calibrated to Flesch Reading Ease 90–100, corresponding to very simple prose readable by early elementary students.

Scope

This model generates readability hints for:

Grade levels: 3 through 8 (elementary and middle school)
Topics covered: Area and Perimeter, Rounding, Multiplication and Division, Algebraic Thinking, Addition and Subtraction, Fractions, Decimals, Place Value, Measurement, Money, Geometry, Charts and Graphs, Estimation, Ratios, Proportions, and Percentages
NOT included: High school mathematics (Algebra II, Trigonometry, Calculus, Statistics)
Language: English only

How to Use

from transformers import AutoModelForCausalLM, AutoTokenizer
import torch

model_id = "sgattup/EstellaExplainerLLM"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(
    model_id,
    torch_dtype=torch.float16,
    device_map="auto"
)

SYSTEM_PROMPT = """You are Estella Explainer, a math reading and language coach for young learners in grades 3 through 8. Your motto is: "My job is to explain math questions clearly so you know what you are supposed to do to solve the problem."

Your rules:
- Give only ONE hint per problem.
- Do NOT solve the problem or show the answer.
- Do NOT show calculation steps or compute anything.
- Help the student understand what the problem is ASKING.
- Use very simple words and short sentences (Flesch Reading Ease 90-100).
- Identify what is known and what is being asked.
- Use phrases like "We know...", "We are looking for...", or "Think about...".
- Be friendly and encouraging."""

PROMPT = SYSTEM_PROMPT + """

### Math Problem:
{}

### Estella's Hint:
"""

def ask_estella(problem, max_tokens=150):
    prompt = PROMPT.format(problem)
    inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
    outputs = model.generate(
        **inputs,
        max_new_tokens=max_tokens,
        temperature=0.7,
        do_sample=True,
        repetition_penalty=1.1
    )
    response = tokenizer.decode(outputs[0], skip_special_tokens=True)
    return response.split("### Estella's Hint:")[-1].strip()

# Example usage
print(ask_estella("A rectangle has a length of 12 cm and a width of 5 cm. What is the area?"))
print(ask_estella("Sam had 3/4 of a pizza. He ate 1/4. How much is left?"))
print(ask_estella("A store sells apples for $0.75 each. Jake buys 6 apples. How much does he spend?"))

Illustrative Outputs

Problem: A rectangle has a length of 12 cm and a width of 5 cm. What is the area?

We know: length = 12 cm and width = 5 cm. Area means length × width. We are looking for how much space is inside the rectangle.

Problem: Sam had 3/4 of a pizza. He ate 1/4. How much is left?

We know: Sam started with 3/4 and ate 1/4. The bottom numbers match. Think: 3/4 - 1/4. We are looking for how much pizza remains.

Problem: A store sells apples for $0.75 each. Jake buys 6 apples. How much does he spend?

We know: each apple costs $0.75 and Jake buys 6. Think: 6 groups of $0.75. We are looking for the total cost.

Problem: Round 4,567 to the nearest hundred.

Look at the tens digit in 4,567. Is it 5 or more? That tells you which way to round. We are looking for the nearest hundred.

Problem: A class has 28 students. They form groups of 4. How many groups are there?

We know: 28 students and each group has 4. Think: 28 shared equally into groups of 4. We are looking for the number of groups.

Intended Users

This model is designed for:

Elementary and middle school teachers seeking on-demand language scaffolds for math problems
Math tutors and interventionists who want to rephrase problems for struggling readers
Parents supporting math homework at home
Education researchers studying AI-generated scaffolding and readability in math instruction
EdTech developers integrating open-weights readability assistance into math learning applications

Model Specifications

Parameter	Value
Base Model	Gemma 4 Effective 4B (`unsloth/gemma-4-E4B-it`)
Fine-tuning Method	QLoRA (Quantized Low-Rank Adaptation) via Unsloth
Quantization	4-bit NF4 with double quantization
LoRA Rank	16
LoRA Alpha	32
Training Epochs	4
Learning Rate	2e-4 (AdamW 8-bit optimizer)
Max Sequence Length	2048 tokens
Training Hardware	Google Colab T4 GPU (free tier)
Dataset	120+ curated Estella-style hint pairs (grades 3–8)
Prompt Format	System role + Problem/Hint instruction template
Language	English
License	Apache 2.0

About Usable Math

Usable Math is a free, open educational resource (OER) licensed under CC-BY-NC 4.0. It provides Google Slides-based interactive math problem-solving modules for grades 3–6, aligned with the Massachusetts Mathematics Curriculum Framework and the Common Core State Standards for Mathematics. Visit usablemath.org.

This model was developed by Sai Gattupalli, PhD, College of Education, University of Massachusetts Amherst, and Principal Scientist at the Society and AI Research Group, in collaboration with Robert W. Maloy and Sharon A. Edwards.

Selected Publications

Maloy, R. W., Gattupalli, S., & Edwards, S. A. (2024). Students Design Problem-Solving Slideshows. Mathematics Teacher: Learning and Teaching PK-12, 117(8), 579–582.

Gattupalli, S., Edwards, S.A, Maloy, R. W., & Rancourt, M. (2023). Designing for Learning: Key Decisions for an Open Online Math Tutor for Elementary Students. Digital Experiences in Mathematics Education.

Gattupalli, S., Maloy, R. W., & Edwards, S. (2023). Comparing Teacher-Written and AI-Generated Math Problem Solving Strategies for Elementary School Students.

Limitations

Dataset scale: Trained on 120+ examples — effective for common problem types, but coverage of unusual problem structures is limited. Performance improves substantially when augmented with extracted hints from Usable Math PDF modules.
No computation: The model is explicitly trained not to solve problems. It will decline to compute answers by design.
English only: No multilingual support in this version.
Not a teacher replacement: Estella provides language scaffolding, not instruction. Human educators remain essential for interpreting student needs and providing individualized support.

Citation

@misc{EstellaExplainerLLM2026,
  author       = {Gattupalli, Sai and Maloy, Robert W. and Edwards, Sharon A.},
  title        = {Estella Explainer LLM: An Open-Weights Readability Coach for Elementary Math Word Problems},
  year         = {2026},
  publisher    = {HuggingFace},
  howpublished = {\url{https://huggingface.co/sgattup/EstellaExplainerLLM}},
  note         = {Fine-tuned from Google Gemma 4 E4B; based on the Usable Math OER (usablemath.org)}
}

This model is part of the Usable Math AI initiative — exploring open, educator-friendly alternatives to proprietary AI tools for elementary mathematics instruction.

Downloads last month: 15

Model tree for sgattup/EstellaExplainerLLM

Base model

google/gemma-4-E4B

Finetuned

google/gemma-4-E4B-it