metadata
metadata:
language: en
library_name: scikit-learn
pipeline_tag: tabular-classification
inference: true
tags:
- diabetes-prediction
- healthcare
- classification
- sklearn
license: mit
author: Thyagharajan K K
flags:
- cpu
- allow_fork
- allow_metrics
emoji: π©Ί
colorFrom: green
colorTo: pink
π©Ί Diabetes Prediction Models
This directory contains machine learning models for diabetes prediction using structured clinical features.
The models are trained on standardized tabular data and evaluated using stratified K-fold cross-validation with robust statistical reporting.
## π Directory Structure
diabetic_models/
β
βββ README.md
βββ CITATION.cff
βββ LICENSE
βββ logistic_reg_diabetic.pkl
β
βββ SGD_scaler_diabetic_models/
βββ README.md
βββ SGD_model.pkl
βββ SGD_scaler.pkl
π§ Available Models
πΉ Baseline Model
- Standard Logistic Regression
- Batch optimization
- Used for benchmarking
πΉ Primary Model
- SGD-based Logistic Regression
- Supports:
- Epoch-wise validation
- Early stopping
- Adaptive learning-rate scheduling
- Serialized with its preprocessing pipeline
βοΈ Preprocessing
All models rely on feature standardization using StandardScaler.
The scaler is stored separately and must be applied during inference.
π License
This repository is released under the MIT License.
See the LICENSE file for full terms.
π Citation
If you use these models in academic work, please cite using the CITATION.cff file.
π€ Author
Dr. Thyagharajan K K
Professor & Dean (Research), RMD Engineering College
π§ kkthyagharajan@yahoo.com