Tutor Talk Moves Classifier (RoBERTa Large)

Model Description

This model is a fine-tuned version of roberta-large that classifies tutor utterances based on teacher talk moves from talkmoves.com. The model identifies whether a tutor's message represents one of the following three categories, or none of them:

  • Classroom management
  • Pressing for accuracy or reasoning
  • Restating or revoicing

Training Data

This model is trained on text of tutoring sessions from three different tutoring providers, annotated by two raters.

  • Inter-Rater Reliability with Krippendorff's Alpha: 0.83
  • Total Training Examples: 1,849
  • Class Distribution:
    Class Examples Percentage
    None 987 53.4%
    Classroom management 284 15.4%
    Pressing for accuracy or reasoning 474 25.6%
    Restating or revoicing 104 5.6%

Data Format

The is trained on utterances with the following format: [PRETEXT] {3 previous messages} [TEXT] {target message}, where [PRETEXT] and [TEXT] are special tokens. Names are anonymized, message text is lowercased, and leading and trailing whitespace is removed.

Example:

[PRETEXT] tutor: hello there [student]
tutor: what is the answer to this problem?
student: the answer is 6 [TEXT] tutor: why do you say the answer is 6?

Performance

Test set results (264 examples):

Class Precision Recall F1-Score Support
None 0.9368 0.9271 0.9319 192
Classroom Management 0.6800 0.7727 0.7234 22
Pressing for Accuracy or Reasoning 0.8605 0.8810 0.8706 42
Restating or Revoicing 0.3333 0.2500 0.2857 8
Macro Average 0.7027 0.7077 0.7029 264
Weighted Average 0.8850 0.8864 0.8852 264
Downloads last month
10
Safetensors
Model size
0.4B params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support