Instructions to use amanrr48/prajna-mistake-predictor with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Scikit-learn
How to use amanrr48/prajna-mistake-predictor with Scikit-learn:
from huggingface_hub import hf_hub_download import joblib model = joblib.load( hf_hub_download("amanrr48/prajna-mistake-predictor", "sklearn_model.joblib") ) # only load pickle files from sources you trust # read more about it here https://skops.readthedocs.io/en/stable/persistence.html - Notebooks
- Google Colab
- Kaggle
PRAJNA Mistake Predictor
Logistic regression model that predicts P(mistake) per student per micro-topic for NEET/JEE exams.
Architecture
- Model: Scikit-learn LogisticRegression with StandardScaler
- 7 features: rolling_accuracy, ability_score, topic_difficulty, exam_importance, avg_time_spent, streak, exam_number
- Label: binary (1 = student accuracy < 50% on topic in exam, 0 = OK)
- Training: Exams 1-8, validation on 9, test on 10
Features
| Feature | Description | Range |
|---|---|---|
| rolling_accuracy | Avg accuracy on this topic across prior exams | 0-1 |
| ability_score | Student's subject-level baseline ability | 0-1 |
| topic_difficulty | Avg difficulty from exam DB, normalized | 0-1 |
| exam_importance | PRAJNA appearance probability for this topic | 0-1 |
| avg_time_spent | Normalized avg time on this topic | 0-1 |
| streak | Consecutive correct/wrong streak (clipped) | -1 to 1 |
| exam_number | Temporal position normalized | 0-1 |
Outputs
p_mistake: probability the student will make a mistake on this topic (0-1)feature_importances: which factors most predict mistakes
Usage
from mistake_predictor import MistakePredictor
mp = MistakePredictor()
X_train, y_train = mp.build_features(results_df, abilities_df, topic_difficulty, prajna_importance)
mp.train(X_train, y_train)
# Per-student predictions
predictions = mp.predict_for_student(results_df, abilities_df, topic_difficulty, prajna_importance, student_id="S1")
Center-View Analysis
The companion MistakeAnalyzer provides aggregate insights:
- Danger Zones: topics where error_rate > threshold AND PRAJNA appearance_probability is high
- Co-failure Patterns: P(fail B | fail A) -- correlated topic failures
- Time vs Accuracy: scatter analysis revealing conceptual gaps
License
Apache 2.0
- Downloads last month
- -
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support