VoltageVagabond/spam-email-dataset
Viewer β’ Updated β’ 4k β’ 2.06k
This repository was created for a senior project in ENGT 375 Applied Machine Learning at Old Dominion University. It is provided for educational and research demonstration purposes only. It is not intended for production use, security filtering, or making real-world spam/phishing decisions. Always use established security tools for operational email protection.
A Gradio web app that classifies emails as spam or ham and provides explainable AI (XAI) insights using three different methods.
pip install -r requirements.txt
python train.py # train the models (first time only)
python app.py # launch the Gradio app
python retrain.py # retrain with accumulated feedback corrections
python retrain.py --no-feedback # retrain with original data only
Voting ensemble (Random Forest + Logistic Regression + SVM) trained on SpamAssassin + Enron email datasets using TF-IDF + 24 metadata features.