Spaces:
Sleeping
Sleeping
Predictive Maintenance ML Pipeline
Automated end-to-end machine learning pipeline for engine predictive maintenance using GitHub Actions.
Quick Start
1. Prerequisites
- GitHub account
- Hugging Face account with token
- Python 3.9+
2. Setup
Add GitHub Secret:
- Go to Repository Settings β Secrets β Actions
- Add secret:
HF_TOKEN= your Hugging Face token
Create Hugging Face Repositories:
- Dataset:
SharleyK/PredictiveMaintenance - Model:
SharleyK/engine-predictive-maintenance
3. Usage
Upload your data:
# Place engine_data.csv in data/ folder
Push to GitHub:
git add .
git commit -m "Initial commit"
git push origin main
The pipeline will run automatically!
Pipeline Overview
- Data Registration - Upload data to Hugging Face
- EDA - Exploratory data analysis
- Data Preparation - Clean and split data
- Model Training - Train 6 models with MLflow
- Decision Tree
- Bagging
- Random Forest
- AdaBoost
- Gradient Boosting
- XGBoost
- Model Registration - Upload best model to HF
Project Structure
predictive-maintenance-pipeline/
βββ .github/workflows/pipeline.yml # GitHub Actions workflow
βββ scripts/ # Python scripts
βββ data/ # Data files
βββ models/ # Trained models
βββ outputs/ # Results
βββ requirements.txt # Dependencies
Results
After pipeline execution:
- Data available at:
https://huggingface.co/datasets/SharleyK/PredictiveMaintenance - Model available at:
https://huggingface.co/SharleyK/engine-predictive-maintenance - MLflow experiments in
mlruns/folder
Documentation
- Full implementation guide in repository
- MLflow UI:
mlflow ui --backend-store-uri file:./mlruns
Support
For issues, create a GitHub issue in this repository.