Binary classifier for track-level YouTube virality (viral) from Spotify + Librosa audio features; stacking combines tuned RF, XGBoost, and LightGBM with a logistic meta-learner.
Stacking ensemble (combined features)
- Level 0: tuned Random Forest, XGBoost, and LightGBM, each producing P(viral) on the same 95-feature inputs (Spotify + Librosa audio; no direct YouTube engagement inputs).
- Level 1: logistic regression combines those three probabilities.
The uploaded stacking_ensemble_combined.joblib artifact is only the logistic meta-learner; production inference requires rf_combined.joblib, xgb_combined.joblib, and lgbm_combined.joblib plus feature_info_combined.joblib (and the same preprocessing as training). Primary reported test metrics for the full stack: AUC โ 0.79, accuracy โ 0.80 on the stratified holdout split.
Deployment:
- Load feature_info_combined.joblib for column order
- Rebuild X with the same exclusions
- Apply the same mean imputation as training
- Load rf_combined.joblib, xgb_combined.joblib, lgbm_combined.joblib, and stacking_ensemble_combined.joblib
- Inference pipeline = three predict_proba[:,1] โ stack โ meta-learner.
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐ Ask for provider support