--- library_name: transformers tags: [] --- # Tutor Talk Moves Classifier (RoBERTa Large) ## Model Description This model is a fine-tuned version of [roberta-large](https://huggingface.co/roberta-large) that classifies tutor utterances based on teacher talk moves from [talkmoves.com](https://www.talkmoves.com/). The model identifies whether a tutor's message represents one of the following three categories, or none of them: - Classroom management - Pressing for accuracy or reasoning - Restating or revoicing ## Training Data This model is trained on text of tutoring sessions from three different tutoring providers, annotated by two raters. - **Inter-Rater Reliability with Krippendorff's Alpha**: 0.83 - **Total Training Examples**: 1,849 - **Class Distribution**: | Class | Examples | Percentage | |-------|----------|------------| | None | 987 | 53.4% | | Classroom management | 284 | 15.4% | | Pressing for accuracy or reasoning | 474 | 25.6% | | Restating or revoicing | 104 | 5.6% | ## Data Format The is trained on utterances with the following format: `[PRETEXT] {3 previous messages} [TEXT] {target message}`, where `[PRETEXT]` and `[TEXT]` are special tokens. Names are anonymized, message text is lowercased, and leading and trailing whitespace is removed. **Example**: ``` [PRETEXT] tutor: hello there [student] tutor: what is the answer to this problem? student: the answer is 6 [TEXT] tutor: why do you say the answer is 6? ``` ## Performance Test set results (264 examples): | Class | Precision | Recall | F1-Score | Support | |-------|-----------|--------|----------|---------| | None | 0.9368 | 0.9271 | 0.9319 | 192 | | Classroom Management | 0.6800 | 0.7727 | 0.7234 | 22 | | Pressing for Accuracy or Reasoning | 0.8605 | 0.8810 | 0.8706 | 42 | | Restating or Revoicing | 0.3333 | 0.2500 | 0.2857 | 8 | | **Macro Average** | **0.7027** | **0.7077** | **0.7029** | **264** | | **Weighted Average** | **0.8850** | **0.8864** | **0.8852** | **264** |