Model Card for Material Sanction Model

Model Details

Developed by: Mei Tan, EduNLP Lab @ Stanford University Graduate School of Education
Release Date: 2025-11-17
Paper: Tan, Mei, and Dorottya Demszky. (2025). Do As I Say: What Teachers’ Language Reveals About Classroom Management Practices. (EdWorkingPaper: 23-844). Retrieved from Annenberg Institute at Brown University: https://doi.org/10.26300/9yj6-jn52

Model Description

This model is a RoBERTa-base classifier fine-tuned to predict binary labels from teacher utterances in classroom transcripts. It was trained on 5720 annotated teacher utterances from elementary math classroom transcripts from the NCTE dataset [1]. It is intended for research on teachers' classroom discourse.

The model classifies whether a teacher utterance is an instance of material sanctioning language. Material sanctions are defined as a subset of behavior management involving consequences that are “more than telling.” These include manipulations of access to material goods or changes to bodily or social states. These include non-exclusionary consequences and exclusionary consequences (calling home and isolating in and outside of the classroom).

Intended Uses

Not intended for evaluation of teaching quality. What is appropriate in a given classroom is highly contextual and relational in a way that this model does not capture.

Data Formatting

The expected input is a single teacher utterance.

Example: "Student D, I'm gonna have you sit in the back of the room please"

Generalizability

The training data for this model come from ~200 observations sampled from the original NCTE study [2], which represents 1652 includes observations of 317 fourth- and fifth-grade mathematics classrooms across 53 schools in New England that were primarily serving low-income students of color. The utterances in this dataset are roughly sentence-length and human-transcribed.

Applying this model to new datasets generalizing to other contexts should involve validation: annotate a sample from the new data context to assess model generalizability.

[1] Demszky, D., & Hill, H. (2023). The NCTE Transcripts: A Dataset of Elementary Math Classroom Transcripts. In 18th Workshop on Innovative Use of NLP for Building Educational Applications.

[2] Kane, Thomas, Hill, Heather, and Staiger, Douglas. National Center for Teacher Effectiveness Main Study. Inter-university Consortium for Political and Social Research [distributor], 2022-06-16. https://doi.org/10.3886/ICPSR36095.v4

Downloads last month: 4

Safetensors

Model size

0.1B params

Tensor type

F32

Dataset used to train stanford-nlpxed/material_sanction_model

Collection including stanford-nlpxed/material_sanction_model

Classroom Management

Collection

Models and datasets for classifying teachers' classroom management talk moves in elementary math classroom transcripts. • 8 items • Updated Nov 20, 2025