Spaces:
Running
Running
File size: 6,104 Bytes
854c114 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 | # π Complete Machine Learning & Data Science Curriculum
## **26 Modules β’ From Zero to Production-Ready ML Engineer**
Welcome to the most comprehensive, hands-on Data Science practice curriculum ever created. This series takes you from **Core Python** to deploying **production ML systems**.
---
## π **Curriculum Structure**
### **π Phase 1: Foundations (Modules 01-02)**
1. **[01_Python_Core_Mastery.ipynb](./01_Python_Core_Mastery.ipynb)**
- **Basics**: Strings, F-Strings, Slicing, Data Structures
- **Intermediate**: Comprehensions, Generators, Decorators
- **Advanced**: OOP (Dunder Methods, Static Methods), Async/Await
- **Expert**: Multithreading vs Multiprocessing (GIL), Singleton Pattern
2. **[02_Statistics_Foundations.ipynb](./02_Statistics_Foundations.ipynb)**
- Central Tendency, Dispersion, Z-Scores
- Correlation, Hypothesis Testing (p-values)
- Links: [Statistics Course](https://aashishgarg13.github.io/DataScience/complete-statistics/)
---
### **π§ Phase 2: Data Science Toolbox (Modules 03-07)**
3. **[03_NumPy_Practice.ipynb](./03_NumPy_Practice.ipynb)** - Numerical Computing
4. **[04_Pandas_Practice.ipynb](./04_Pandas_Practice.ipynb)** - Data Manipulation
5. **[05_Matplotlib_Seaborn_Practice.ipynb](./05_Matplotlib_Seaborn_Practice.ipynb)** - Visualization
6. **[06_EDA_and_Feature_Engineering.ipynb](./06_EDA_and_Feature_Engineering.ipynb)** - Real Titanic Dataset
7. **[07_Scikit_Learn_Practice.ipynb](./07_Scikit_Learn_Practice.ipynb)** - Pipelines & GridSearch
---
### **π€ Phase 3: Supervised Learning (Modules 08-14)**
8. **[08_Linear_Regression.ipynb](./08_Linear_Regression.ipynb)** - Diamonds Dataset
9. **[09_Logistic_Regression.ipynb](./09_Logistic_Regression.ipynb)** - Breast Cancer Dataset
10. **[10_Support_Vector_Machines.ipynb](./10_Support_Vector_Machines.ipynb)** - Kernel Trick
11. **[11_K_Nearest_Neighbors.ipynb](./11_K_Nearest_Neighbors.ipynb)** - Iris Dataset
12. **[12_Naive_Bayes.ipynb](./12_Naive_Bayes.ipynb)** - Text Classification
13. **[13_Decision_Trees_and_Random_Forests.ipynb](./13_Decision_Trees_and_Random_Forests.ipynb)** - Penguins Dataset
14. **[14_Gradient_Boosting_XGBoost.ipynb](./14_Gradient_Boosting_XGBoost.ipynb)** - Kaggle Champion
---
### **π Phase 4: Unsupervised Learning (Modules 15-16)**
15. **[15_KMeans_Clustering.ipynb](./15_KMeans_Clustering.ipynb)** - Elbow Method
16. **[16_Dimensionality_Reduction_PCA.ipynb](./16_Dimensionality_Reduction_PCA.ipynb)** - Digits Dataset
---
### **π§ Phase 5: Advanced ML (Modules 17-20)**
17. **[17_Neural_Networks_Deep_Learning.ipynb](./17_Neural_Networks_Deep_Learning.ipynb)** - MNIST with MLPClassifier
18. **[18_Time_Series_Analysis.ipynb](./18_Time_Series_Analysis.ipynb)** - Air Passengers Dataset
19. **[19_Natural_Language_Processing_NLP.ipynb](./19_Natural_Language_Processing_NLP.ipynb)** - Sentiment Analysis
20. **[20_Reinforcement_Learning_Basics.ipynb](./20_Reinforcement_Learning_Basics.ipynb)** - Q-Learning Grid World
---
### **πΌ Phase 6: Industry Skills (Modules 21-23)**
21. **[21_Kaggle_Project_Medical_Costs.ipynb](./21_Kaggle_Project_Medical_Costs.ipynb)** - Full Pipeline
22. **[22_SQL_for_Data_Science.ipynb](./22_SQL_for_Data_Science.ipynb)** - Database Integration
23. **[23_Model_Explainability_SHAP.ipynb](./23_Model_Explainability_SHAP.ipynb)** - XAI with SHAP
---
### **π Phase 7: Production & Deployment (Modules 24-26)** β NEW!
24. **[24_Deep_Learning_TensorFlow.ipynb](./24_Deep_Learning_TensorFlow.ipynb)** - TensorFlow/Keras & CNNs
25. **[25_Model_Deployment_Streamlit.ipynb](./25_Model_Deployment_Streamlit.ipynb)** - Web App Deployment
26. **[26_End_to_End_ML_Project.ipynb](./26_End_to_End_ML_Project.ipynb)** - Production Pipeline
---
## π οΈ **Setup Instructions**
### **1. Install Dependencies**
```bash
pip install -r requirements.txt
```
### **2. Launch Jupyter**
```bash
jupyter notebook
```
### **3. Start Learning!**
Open `01_Python_Core_Mastery.ipynb` and work sequentially through Module 26.
---
## π **Website Integration**
This curriculum is designed to work seamlessly with the **[DataScience Learning Hub](https://aashishgarg13.github.io/DataScience/)**. Each ML module links to interactive visualizations and theory.
---
## π **What Makes This Curriculum Unique?**
β
**26 Complete Modules** - From Python basics to production deployment
β
**Real Datasets** - Titanic, MNIST, Kaggle Insurance, and more
β
**Website Integration** - Links to visual demos for every concept
β
**Industry-Ready** - Includes SQL, SHAP, Design Patterns, Async programming
β
**Production Skills** - TensorFlow, Streamlit, Model Deployment
β
**Git-Ready** - Initialized with version control
---
## π **Key Files**
- **[CURRICULUM_REVIEW.md](./CURRICULUM_REVIEW.md)** - Quality assessment of all modules
- **[README_Resources.md](./README_Resources.md)** - External learning resources
- **[requirements.txt](./requirements.txt)** - All dependencies
---
## π― **Who Is This For?**
- π **Students** learning Data Science from scratch
- πΌ **Professionals** preparing for DS/ML interviews
- π§βπ» **Developers** transitioning to ML engineering
- π **Kagglers** wanting structured practice
---
## π **Learning Path**
**Beginner** (Weeks 1-4): Modules 01-07
**Intermediate** (Weeks 5-8): Modules 08-16
**Advanced** (Weeks 9-12): Modules 17-23
**Expert** (Weeks 13-14): Modules 24-26
---
## π **After Completion**
You will be able to:
- β
Build end-to-end ML systems
- β
Deploy models as web applications
- β
Compete in Kaggle competitions
- β
Pass ML engineering interviews
- β
Explain model decisions with SHAP
---
## π€ **Contributing**
This curriculum is part of a personal learning journey integrated with [aashishgarg13.github.io/DataScience/](https://aashishgarg13.github.io/DataScience/).
---
## π **License**
For educational purposes. Feel free to learn and adapt!
---
**Ready to become a Machine Learning Engineer?** Start with `01_Python_Core_Mastery.ipynb`! π
|