Instructions to use pfizer-project-team/binary-segA-vs-segBC with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Scikit-learn
How to use pfizer-project-team/binary-segA-vs-segBC with Scikit-learn:
from huggingface_hub import hf_hub_download import joblib model = joblib.load( hf_hub_download("pfizer-project-team/binary-segA-vs-segBC", "sklearn_model.joblib") ) # only load pickle files from sources you trust # read more about it here https://skops.readthedocs.io/en/stable/persistence.html - Notebooks
- Google Colab
- Kaggle
| license: other | |
| library_name: sklearn | |
| tags: | |
| - binary-classification | |
| - tabular-classification | |
| - healthcare | |
| - physician-segmentation | |
| # Binary SEG_A vs SEG_B/C Classifier | |
| This repository contains the selected binary classifier for the first stage of a hierarchical physician segmentation strategy. | |
| ## Task | |
| Binary classification: | |
| - `0`: SEG_A | |
| - `1`: SEG_B/C | |
| The model predicts whether a physician belongs to `SEG_A` or should be routed to the second-stage `SEG_B` vs `SEG_C` classifier. | |
| ## Selected Model | |
| Best model: `HistGradientBoosting` | |
| Decision threshold for SEG_B/C: `0.45` | |
| ## Files | |
| - `best_binary_segA_vs_segBC.joblib`: trained model | |
| - `model_metadata.json`: model configuration and selected threshold | |
| - `binary_model_threshold_comparison_validation.csv`: validation threshold comparison | |
| - `test_predictions_binary_segA_vs_segBC_with_hcp_id.csv`: test-set predictions with HCP ID | |
| ## Notes | |
| The model uses flattened temporal tensors as input. Each physician is represented by weekly behavior across multiple features. | |
| The prediction probability `prob_SEG_BC` can be used to decide whether a physician should be classified as `SEG_A` or passed to the next B/C decision model. | |