Larxmind's picture
Updated description
ccad116 verified
---
language:
- en
- es
tags:
- education
- early-warning-system
- sklearn
- tabular-classification
- random-forest
---
# Student Dropout Predictor (Early Warning System)
## Model Description
This model is a Gradient Boosting Classifier classifier designed to predict the likelihood of student dropout (academic churn) in a virtual learning environment. It serves as the core analytical engine for an Early Warning System (EWS) that triggers proactive generative AI interventions.
- **Model Type:** Random Forest Classifier
- **Task:** Tabular Classification (Binary: Dropout vs. Retained)
- **Language:** English/Spanish feature support
- **Interpretability:** Fully compatible with SHAP (SHapley Additive exPlanations) to extract individual risk factors.
## Intended Use
This model is intended for educational institutions to identify at-risk students based on their telemetry and engagement data (clicks, VLE interactions, assignment submissions, and demographics).
It is designed to be used in conjunction with a generative AI agent (e.g., Claude 3.5 via AWS Bedrock) to draft personalized intervention emails based on the root causes of the predicted risk.
## Training Data
The model was trained on an anonymized dataset comprising student demographics, historical academic performance, and daily interaction logs with the Virtual Learning Environment (VLE).
## Features
The input features include, but are not limited to:
- `vle_clicks`: Total interactions with the digital platform.
- `attendance_days`: Active days in the system.
- `assessments_submitted`: Number of assignments handed in.
- Demographics: Age band, disability status, prior education level.
## Metrics
*(Note: Update these metrics based on your final evaluation script)*
- **Accuracy:** 0.77
- **F1-Score:** 0.78
- **ROC-AUC:** 0.84