File size: 3,892 Bytes
58b9f0a 1a07743 58b9f0a 1a07743 58b9f0a 1a07743 58b9f0a 1a07743 58b9f0a 1a07743 58b9f0a 1a07743 58b9f0a 1a07743 58b9f0a 1a07743 58b9f0a 1a07743 58b9f0a 1a07743 58b9f0a 1a07743 58b9f0a 1a07743 58b9f0a 1a07743 58b9f0a 1a07743 58b9f0a 1a07743 58b9f0a 1a07743 58b9f0a 1a07743 58b9f0a 1a07743 58b9f0a 1a07743 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 |
---
license: mit
language:
- en
library_name: scikit-learn
tags:
- predictive-maintenance
- random-forest
- binary-classification
- engine-maintenance
datasets:
- nasa-cmapss
metrics:
- accuracy
- f1
- f2
- roc-auc
---
# Engine Predictive Maintenance Model
## Model Overview
This is a **Tuned Random Forest Classifier** trained for predictive engine maintenance with SMOTE oversampling to handle class imbalance and achieve high recall for failure detection.
## Model Details
- **Model Type**: Random Forest Classifier with SMOTE Pipeline
- **Framework**: scikit-learn, imbalanced-learn
- **Task**: Binary Classification (Engine Condition: Good/Failing)
- **Input Features**: 14 engineered sensor features (RPM, pressure, temperature, etc.)
- **Output**: Probability of engine failure (0-1)
## Model Performance
### Test Set Metrics
| Metric | Score |
|--------|-------|
| Accuracy | 0.6340 |
| Precision | 0.7456 |
| Recall | 0.6366 |
| F1 Score | 0.6868 |
| F2 Score | 0.6558 |
| ROC-AUC | 0.6893 |
| Brier Score | 0.2195 |
## Key Insights
- **High Recall (0.6366)**: Detects ~64% of actual failures
- **Competitive Precision (0.7456)**: ~75% of predictions are correct
- **Strong AUC (0.6893)**: Good discrimination between failure and non-failure cases
## Intended Use
This model is designed for:
- **Predictive Maintenance**: Identify engines at risk of failure before breakdown
- **Condition Monitoring**: Support data-driven maintenance decision-making
- **Fleet Management**: Optimize maintenance scheduling and resource allocation
- **Risk Assessment**: Provide failure probability scores for maintenance prioritization
## Limitations
- Trained on historical engine data with specific sensor configurations
- Performance may vary with new sensor types or operating conditions
- Model requires regular retraining with updated failure data
- Does not capture temporal degradation patterns (time-series)
- Assumes consistent sensor calibration and operating conditions
## Training Data
- **Dataset**: Engine Predictive Maintenance Dataset
- **Total Samples**: 19,581 engines
- **Training Samples**: 13,674 (70%)
- **Test Samples**: 3,907 (20%)
- **Features**: 14 engineered features (6 raw + 8 derived)
- **Class Distribution**: Imbalanced (Good: ~63%, Failure: ~37%)
## Training Procedure
1. Data preprocessing and feature engineering
2. Train-test split (70-20-10)
3. SMOTE oversampling on training data to handle class imbalance
4. Hyperparameter tuning via GridSearchCV with 5-fold cross-validation
5. Model evaluation on held-out test set
## Hyperparameters
- **n_estimators**: 400
- **max_depth**: 12
- **min_samples_leaf**: 4
- **SMOTE k_neighbors**: 5
- **Random state**: 42
## Recommendations
1. **Threshold Tuning**: Adjust decision threshold based on cost of false positives vs. false negatives
2. **Continuous Monitoring**: Track model performance in production and retrain quarterly with new data
3. **Feature Importance**: Use SHAP or feature importance analysis to identify critical sensors
4. **Ensemble Approaches**: Consider combining with other models (XGBoost, LightGBM) for robust predictions
5. **Domain Expertise**: Combine predictions with expert knowledge for final maintenance decisions
## Citation
If you use this model, please cite:
```
@misc{predictive-maintenance-model-2026,
title={Engine Predictive Maintenance Model},
author={GreatLearning Capstone Team},
year={2026},
howpublished={Hugging Face Hub},
url={https://huggingface.co/models/nilanjanadevc/engine-predictive-maintenance-model}
}
```
## License
This model is released under the MIT License. See LICENSE file for details.
## Contact & Support
For questions or issues:
- GitHub: [Check repository](https://github.com/nilanjanadevc/predictive-engine-maintainence-mlops)
- Hugging Face: [@nilanjanadevc](https://huggingface.co/nilanjanadevc)
|