File size: 3,892 Bytes
58b9f0a
 
1a07743
 
 
 
 
 
 
 
58b9f0a
1a07743
58b9f0a
1a07743
 
 
 
58b9f0a
 
 
 
1a07743
 
58b9f0a
1a07743
 
58b9f0a
 
1a07743
 
58b9f0a
 
 
 
 
1a07743
58b9f0a
1a07743
 
 
 
 
 
 
 
 
 
 
 
58b9f0a
 
 
 
 
 
 
1a07743
58b9f0a
 
 
 
 
 
 
1a07743
58b9f0a
 
 
 
1a07743
 
 
 
58b9f0a
 
 
 
 
 
1a07743
 
 
58b9f0a
1a07743
 
 
 
 
 
58b9f0a
 
 
1a07743
 
 
 
 
58b9f0a
 
 
 
 
 
1a07743
58b9f0a
1a07743
58b9f0a
1a07743
 
58b9f0a
 
 
 
 
 
 
1a07743
58b9f0a
1a07743
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
---
license: mit
language:
  - en
library_name: scikit-learn
tags:
  - predictive-maintenance
  - random-forest
  - binary-classification
  - engine-maintenance
datasets:
  - nasa-cmapss
metrics:
  - accuracy
  - f1
  - f2
  - roc-auc
---

# Engine Predictive Maintenance Model

## Model Overview
This is a **Tuned Random Forest Classifier** trained for predictive engine maintenance with SMOTE oversampling to handle class imbalance and achieve high recall for failure detection.

## Model Details
- **Model Type**: Random Forest Classifier with SMOTE Pipeline
- **Framework**: scikit-learn, imbalanced-learn
- **Task**: Binary Classification (Engine Condition: Good/Failing)
- **Input Features**: 14 engineered sensor features (RPM, pressure, temperature, etc.)
- **Output**: Probability of engine failure (0-1)

## Model Performance

### Test Set Metrics

| Metric | Score |
|--------|-------|
| Accuracy | 0.6340 |
| Precision | 0.7456 |
| Recall | 0.6366 |
| F1 Score | 0.6868 |
| F2 Score | 0.6558 |
| ROC-AUC | 0.6893 |
| Brier Score | 0.2195 |

## Key Insights
- **High Recall (0.6366)**: Detects ~64% of actual failures
- **Competitive Precision (0.7456)**: ~75% of predictions are correct
- **Strong AUC (0.6893)**: Good discrimination between failure and non-failure cases

## Intended Use

This model is designed for:
- **Predictive Maintenance**: Identify engines at risk of failure before breakdown
- **Condition Monitoring**: Support data-driven maintenance decision-making
- **Fleet Management**: Optimize maintenance scheduling and resource allocation
- **Risk Assessment**: Provide failure probability scores for maintenance prioritization

## Limitations

- Trained on historical engine data with specific sensor configurations
- Performance may vary with new sensor types or operating conditions
- Model requires regular retraining with updated failure data
- Does not capture temporal degradation patterns (time-series)
- Assumes consistent sensor calibration and operating conditions

## Training Data

- **Dataset**: Engine Predictive Maintenance Dataset
- **Total Samples**: 19,581 engines
- **Training Samples**: 13,674 (70%)
- **Test Samples**: 3,907 (20%)
- **Features**: 14 engineered features (6 raw + 8 derived)
- **Class Distribution**: Imbalanced (Good: ~63%, Failure: ~37%)

## Training Procedure

1. Data preprocessing and feature engineering
2. Train-test split (70-20-10)
3. SMOTE oversampling on training data to handle class imbalance
4. Hyperparameter tuning via GridSearchCV with 5-fold cross-validation
5. Model evaluation on held-out test set

## Hyperparameters
- **n_estimators**: 400
- **max_depth**: 12
- **min_samples_leaf**: 4
- **SMOTE k_neighbors**: 5
- **Random state**: 42

## Recommendations

1. **Threshold Tuning**: Adjust decision threshold based on cost of false positives vs. false negatives
2. **Continuous Monitoring**: Track model performance in production and retrain quarterly with new data
3. **Feature Importance**: Use SHAP or feature importance analysis to identify critical sensors
4. **Ensemble Approaches**: Consider combining with other models (XGBoost, LightGBM) for robust predictions
5. **Domain Expertise**: Combine predictions with expert knowledge for final maintenance decisions

## Citation

If you use this model, please cite:

```
@misc{predictive-maintenance-model-2026,
  title={Engine Predictive Maintenance Model},
  author={GreatLearning Capstone Team},
  year={2026},
  howpublished={Hugging Face Hub},
  url={https://huggingface.co/models/nilanjanadevc/engine-predictive-maintenance-model}
}
```

## License

This model is released under the MIT License. See LICENSE file for details.

## Contact & Support

For questions or issues:
- GitHub: [Check repository](https://github.com/nilanjanadevc/predictive-engine-maintainence-mlops)
- Hugging Face: [@nilanjanadevc](https://huggingface.co/nilanjanadevc)