Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,34 @@
|
|
| 1 |
-
---
|
| 2 |
-
license: mit
|
| 3 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: mit
|
| 3 |
+
---
|
| 4 |
+
# Model Card for Exclusionary Consequence Model
|
| 5 |
+
|
| 6 |
+
## Model Details
|
| 7 |
+
- Developed by: Mei Tan, EduNLP Lab @ Stanford University Graduate School of Education
|
| 8 |
+
- Release Date: 2025-11-17
|
| 9 |
+
- Paper: Tan, Mei, and Dorottya Demszky. (2025). Do As I Say: What Teachers’ Language Reveals About Classroom Management Practices. (EdWorkingPaper: 23-844). Retrieved from Annenberg Institute at Brown University: https://doi.org/10.26300/9yj6-jn52
|
| 10 |
+
|
| 11 |
+
## Model Description
|
| 12 |
+
This model is a RoBERTa-base classifier fine-tuned to predict binary labels from teacher utterances in classroom transcripts. It was trained on 5720 annotated teacher utterances from elementary math classroom transcripts from the NCTE dataset [1]. It is intended for research on teachers' classroom discourse.
|
| 13 |
+
|
| 14 |
+
The model classifies whether a teacher utterance is an instance of exclusionary consequence language. Exclusionary consequences are a subset of material sanctions that involves Calling Home, In-Class Isolation, or Out-of-Class Isolation.
|
| 15 |
+
|
| 16 |
+
## Intended Uses
|
| 17 |
+
Not intended for evaluation of teaching quality. What is appropriate in a given classroom is highly contextual and relational in a way that this model does not capture.
|
| 18 |
+
|
| 19 |
+
## Data Formatting
|
| 20 |
+
The expected input is a single teacher utterance.
|
| 21 |
+
|
| 22 |
+
Example: "Student D, I'm gonna have you sit in the back of the room please"
|
| 23 |
+
|
| 24 |
+
## Generalizability
|
| 25 |
+
The training data for this model come from ~200 observations sampled from the original NCTE study [2], which represents 1652 includes observations of 317 fourth- and fifth-grade mathematics classrooms across 53 schools in
|
| 26 |
+
New England that were primarily serving low-income students of color. The utterances in this dataset are roughly sentence-length and human-transcribed.
|
| 27 |
+
|
| 28 |
+
Applying this model to new datasets generalizing to other contexts should involve validation: annotate a sample from the new data context to assess model generalizability.
|
| 29 |
+
|
| 30 |
+
|
| 31 |
+
|
| 32 |
+
[1] Demszky, D., & Hill, H. (2023). The NCTE Transcripts: A Dataset of Elementary Math Classroom Transcripts. In 18th Workshop on Innovative Use of NLP for Building Educational Applications.
|
| 33 |
+
|
| 34 |
+
[2] Kane, Thomas, Hill, Heather, and Staiger, Douglas. National Center for Teacher Effectiveness Main Study. Inter-university Consortium for Political and Social Research [distributor], 2022-06-16. https://doi.org/10.3886/ICPSR36095.v4
|