Wipoba
/

user-clustering-model

Model card Files Files and versions

user-clustering-model / README.md

Wipoba's picture

Add clustering models, metadata, and README

aa29186 7 months ago

|

history blame contribute delete

1.02 kB

User Clustering Model

This repository contains models and artifacts for a user clustering pipeline.

Models

Preprocessor (OneHotEncoder + StandardScaler)
UMAP reducer for dimensionality reduction
KMeans clustering model with k=15

Metrics

Best silhouette score on training: 0.4733
Recommended silhouette score threshold for triggering auto retrain: 0.4

Files

preprocessor.joblib : preprocessing pipeline
umap_reducer.joblib : UMAP reducer
kmeans_model.joblib : KMeans model
top_categories.json : top categories for cardinality limiting
cluster_sizes.png : cluster distribution plot
metadata.json : metadata JSON with metrics and parameters

Usage

Load the models using joblib.load(), preprocess incoming data with the preprocessor, transform with UMAP, then predict clusters using KMeans.

Auto retrain can be triggered if silhouette score on new data falls below 0.4.

License

Specify your license here.

Generated and pushed by your clustering pipeline.