ExtraaLearn Lead Conversion Prediction — Full End‑to‑End Notebook

This notebook performs:

Load Dataset

Basic Data Cleaning

Exploratory Data Analysis (EDA)

Preprocessing Pipeline

Train-Test Split

Baseline Model Training

Hyperparameter Tuning

Final Model Evaluation

Serialize Final Model

🔍 Actionable Insights & Business Recommendations

(Based on EDA findings and final model performance)


📌 I. Key Data‑Driven Insights

1. Profile Completion Is the Strongest Conversion Signal


2. Website-Based First Interaction Converts Far Better Than Mobile App


3. Last Interaction Type Strongly Predicts Lead Warmth

Conversion rates by last_activity:

Website & email interactions reflect active interest, while phone interactions lag behind.


4. Engagement Depth Influences Conversion

Correlation with target:

Longer sessions suggest research behavior typical of high-intent users.


5. Occupation Impacts Conversion Probability

Conversion rates:

Professionals show higher readiness and ability to pay, making them a key target group.


📌 II. Model‑Based Insights

The final tuned model (Gradient Boosting Classifier) achieved:

This confirms the model is highly reliable at identifying leads with strong conversion likelihood.


📌 III. Actionable Business Recommendations

1. Optimize the Profile Completion Journey

Since profile completion has the highest impact:


2. Focus Marketing Spend on Website Traffic

Because website-origin leads convert 4× better:


3. Prioritize Email & Web-Active Leads in Sales Routing

Assign higher priority to:


4. Use Behavioral Signals for Personalized Outreach


5. Create Occupation-Based Lead Personas


6. Deploy Probability-Based Lead Tiering

Using model-generated conversion probabilities:

Tier Probability Range Action
Hot Leads p ≥ 0.80 Immediate sales call / WhatsApp follow-up
Warm Leads 0.50 ≤ p < 0.80 Email + webinar invites + remarketing
Cold Leads p < 0.50 Long-term nurture campaigns

This maximizes sales efficiency and reduces CAC.


7. Retrain & Monitor Model Performance Regularly


📌 IV. Strategic Takeaways

Profile completion, website entry, and deeper engagement consistently indicate high intent.
Professionals and website-origin leads are the highest-value groups.
Your model effectively identifies warm/hot leads, enabling smarter sales operations.
Targeted interventions based on these insights can significantly boost overall conversion rates.


🔗 Hugging Face Docker Space URL for Backend Flask API's

Hugging Face Backend Space URL:
https://huggingface.co/spaces/manoj112025/extraaLearnModel

🔗 Hugging Face Docker Space URL for Streamlit ExtraLearn Model for frontend space

Hugging Face Frontend Space URL:
https://huggingface.co/spaces/manoj112025/StreamLitExtraLearnFrontendModel