diff --git "a/math-ds-complete/index.html" "b/math-ds-complete/index.html"
--- "a/math-ds-complete/index.html"
+++ "b/math-ds-complete/index.html"
@@ -16,6 +16,7 @@
                 <button class="subject-tab" data-subject="linear-algebra">📐 Linear Algebra</button>
                 <button class="subject-tab" data-subject="calculus">∫ Calculus</button>
                 <button class="subject-tab" data-subject="data-science">🤖 Data Science</button>
+                <button class="subject-tab" data-subject="machine-learning">🚀 Machine Learning</button>
             </div>
             <button class="mobile-menu-btn" id="mobileMenuBtn">
                 <span></span>
@@ -233,6 +234,89 @@
                         <li><a href="#topic-85" class="topic-link" data-topic="85">85. Loss Functions</a></li>
                     </ul>
                 </div>
+
+                <!-- Machine Learning Modules -->
+                <div class="module" data-subject="machine-learning" style="display: none;">
+                    <h4 class="module-title">Supervised Learning - Regression</h4>
+                    <ul class="topic-list">
+                        <li><a href="#ml-1" class="topic-link" data-topic="ml-1">ML-1. Linear Regression</a></li>
+                        <li><a href="#ml-2" class="topic-link" data-topic="ml-2">ML-2. Polynomial Regression</a></li>
+                        <li><a href="#ml-3" class="topic-link" data-topic="ml-3">ML-3. Ridge Regression (L2)</a></li>
+                        <li><a href="#ml-4" class="topic-link" data-topic="ml-4">ML-4. Lasso Regression (L1)</a></li>
+                        <li><a href="#ml-5" class="topic-link" data-topic="ml-5">ML-5. Elastic Net</a></li>
+                        <li><a href="#ml-6" class="topic-link" data-topic="ml-6">ML-6. Support Vector Regression</a></li>
+                    </ul>
+                </div>
+
+                <div class="module" data-subject="machine-learning" style="display: none;">
+                    <h4 class="module-title">Supervised Learning - Classification</h4>
+                    <ul class="topic-list">
+                        <li><a href="#ml-7" class="topic-link" data-topic="ml-7">ML-7. Logistic Regression</a></li>
+                        <li><a href="#ml-8" class="topic-link" data-topic="ml-8">ML-8. K-Nearest Neighbors</a></li>
+                        <li><a href="#ml-9" class="topic-link" data-topic="ml-9">ML-9. Support Vector Machines</a></li>
+                        <li><a href="#ml-10" class="topic-link" data-topic="ml-10">ML-10. Decision Trees</a></li>
+                        <li><a href="#ml-11" class="topic-link" data-topic="ml-11">ML-11. Naive Bayes</a></li>
+                        <li><a href="#ml-12" class="topic-link" data-topic="ml-12">ML-12. Random Forest</a></li>
+                        <li><a href="#ml-13" class="topic-link" data-topic="ml-13">ML-13. Gradient Boosting</a></li>
+                        <li><a href="#ml-14" class="topic-link" data-topic="ml-14">ML-14. Neural Networks</a></li>
+                    </ul>
+                </div>
+
+                <div class="module" data-subject="machine-learning" style="display: none;">
+                    <h4 class="module-title">Unsupervised - Clustering</h4>
+                    <ul class="topic-list">
+                        <li><a href="#ml-15" class="topic-link" data-topic="ml-15">ML-15. K-Means Clustering</a></li>
+                        <li><a href="#ml-16" class="topic-link" data-topic="ml-16">ML-16. Hierarchical Clustering</a></li>
+                        <li><a href="#ml-17" class="topic-link" data-topic="ml-17">ML-17. DBSCAN</a></li>
+                        <li><a href="#ml-18" class="topic-link" data-topic="ml-18">ML-18. Gaussian Mixture Models</a></li>
+                    </ul>
+                </div>
+
+                <div class="module" data-subject="machine-learning" style="display: none;">
+                    <h4 class="module-title">Unsupervised - Dim. Reduction</h4>
+                    <ul class="topic-list">
+                        <li><a href="#ml-19" class="topic-link" data-topic="ml-19">ML-19. PCA</a></li>
+                        <li><a href="#ml-20" class="topic-link" data-topic="ml-20">ML-20. t-SNE</a></li>
+                        <li><a href="#ml-21" class="topic-link" data-topic="ml-21">ML-21. Autoencoders</a></li>
+                    </ul>
+                </div>
+
+                <div class="module" data-subject="machine-learning" style="display: none;">
+                    <h4 class="module-title">Reinforcement Learning</h4>
+                    <ul class="topic-list">
+                        <li><a href="#ml-22" class="topic-link" data-topic="ml-22">ML-22. Q-Learning</a></li>
+                        <li><a href="#ml-23" class="topic-link" data-topic="ml-23">ML-23. Deep Q-Networks</a></li>
+                        <li><a href="#ml-24" class="topic-link" data-topic="ml-24">ML-24. Policy Gradient</a></li>
+                    </ul>
+                </div>
+
+                <div class="module" data-subject="machine-learning" style="display: none;">
+                    <h4 class="module-title">Model Evaluation &amp; Optimization</h4>
+                    <ul class="topic-list">
+                        <li><a href="#ml-25" class="topic-link" data-topic="ml-25">ML-25. Cross-Validation</a></li>
+                        <li><a href="#ml-26" class="topic-link" data-topic="ml-26">ML-26. GridSearch &amp; RandomSearch</a></li>
+                        <li><a href="#ml-27" class="topic-link" data-topic="ml-27">ML-27. Hyperparameter Tuning</a></li>
+                        <li><a href="#ml-28" class="topic-link" data-topic="ml-28">ML-28. Model Evaluation Metrics</a></li>
+                        <li><a href="#ml-29" class="topic-link" data-topic="ml-29">ML-29. Regularization</a></li>
+                        <li><a href="#ml-30" class="topic-link" data-topic="ml-30">ML-30. Bias-Variance Tradeoff</a></li>
+                    </ul>
+                </div>
+
+                <div class="module" data-subject="machine-learning" style="display: none;">
+                    <h4 class="module-title">Advanced Topics</h4>
+                    <ul class="topic-list">
+                        <li><a href="#ml-31" class="topic-link" data-topic="ml-31">ML-31. Ensemble Methods</a></li>
+                        <li><a href="#ml-32" class="topic-link" data-topic="ml-32">ML-32. Feature Engineering</a></li>
+                        <li><a href="#ml-33" class="topic-link" data-topic="ml-33">ML-33. Imbalanced Data</a></li>
+                        <li><a href="#ml-34" class="topic-link" data-topic="ml-34">ML-34. Time Series Analysis</a></li>
+                        <li><a href="#ml-35" class="topic-link" data-topic="ml-35">ML-35. Anomaly Detection</a></li>
+                        <li><a href="#ml-36" class="topic-link" data-topic="ml-36">ML-36. Transfer Learning</a></li>
+                        <li><a href="#ml-37" class="topic-link" data-topic="ml-37">ML-37. Fine-tuning Models</a></li>
+                        <li><a href="#ml-38" class="topic-link" data-topic="ml-38">ML-38. Model Interpretability</a></li>
+                        <li><a href="#ml-39" class="topic-link" data-topic="ml-39">ML-39. Optimization Algorithms</a></li>
+                        <li><a href="#ml-40" class="topic-link" data-topic="ml-40">ML-40. Batch Norm &amp; Dropout</a></li>
+                    </ul>
+                </div>
             </div>
         </aside>
 
@@ -7547,6 +7631,2072 @@
                 </div>
             </section>
 
+            <!-- MACHINE LEARNING ALGORITHMS START HERE -->
+
+            <!-- ML-1: Linear Regression -->
+            <section class="topic-section ml-section" id="ml-1" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-regression">ML Algorithm 1</span>
+                    <h2>📈 Linear Regression</h2>
+                    <p class="topic-subtitle">Predicting continuous values with a straight line</p>
+                </div>
+
+                <div class="content-card">
+                    <h3>📚 What is Linear Regression?</h3>
+                    <p>Linear regression is the simplest supervised learning algorithm that models the relationship between input features and a continuous output variable using a straight line (in 2D) or hyperplane (in higher dimensions).</p>
+                    <p><strong>Analogy:</strong> Like drawing the best-fit line through scattered points on a graph to predict future values based on the trend.</p>
+                </div>
+
+                <div class="callout-box insight">
+                    <div class="callout-header">💡 How It Works</div>
+                    <p><strong>Step-by-step intuition:</strong></p>
+                    <ol>
+                        <li>Plot your data points on a graph</li>
+                        <li>Find the line that minimizes distance to all points</li>
+                        <li>Use "least squares" - minimize sum of squared errors</li>
+                        <li>Calculate optimal slope and intercept mathematically</li>
+                        <li>Use the line to predict new values</li>
+                    </ol>
+                </div>
+
+                <div class="content-card">
+                    <h3>🧮 Mathematics Behind It</h3>
+                    <div class="formula-card">
+                        <div class="formula-header">Equation</div>
+                        <div class="formula-main">y = β₀ + β₁x + ε</div>
+                        <p>β₀ = intercept, β₁ = slope, ε = error</p>
+                    </div>
+                    <div class="formula-card">
+                        <div class="formula-header">Slope Calculation</div>
+                        <div class="formula-main">β₁ = Σ(xᵢ-x̄)(yᵢ-ȳ) / Σ(xᵢ-x̄)²</div>
+                    </div>
+                    <div class="formula-card">
+                        <div class="formula-header">Intercept Calculation</div>
+                        <div class="formula-main">β₀ = ȳ - β₁x̄</div>
+                    </div>
+                    <div class="formula-card">
+                        <div class="formula-header">Cost Function (MSE)</div>
+                        <div class="formula-main">J = (1/n)Σ(yᵢ - ŷᵢ)²</div>
+                    </div>
+                </div>
+
+                <div class="worked-example-section">
+                    <h3>📝 Worked Example - Predicting House Prices</h3>
+                    
+                    <div class="example-problem">
+                        <h4>Problem:</h4>
+                        <p class="problem-statement">A real estate company has data on 5 houses. Predict the price of a 2500 sq ft house.</p>
+                        <table class="calculation-table">
+                            <tr><th>Size (sq ft)</th><th>Price ($1000s)</th></tr>
+                            <tr><td>1000</td><td>150</td></tr>
+                            <tr><td>1500</td><td>200</td></tr>
+                            <tr><td>2000</td><td>250</td></tr>
+                            <tr><td>2500</td><td>?</td></tr>
+                            <tr><td>3000</td><td>350</td></tr>
+                        </table>
+                    </div>
+                    
+                    <div class="example-solution">
+                        <h4>Solution:</h4>
+                        
+                        <div class="solution-step">
+                            <div class="step-number">Step 1:</div>
+                            <div class="step-content">
+                                <p class="step-description">Calculate Means</p>
+                                <div class="step-work">
+                                    <code>x̄ = (1000 + 1500 + 2000 + 3000) / 4 = 1875 sq ft</code><br>
+                                    <code>ȳ = (150 + 200 + 250 + 350) / 4 = 237.5 ($1000s)</code>
+                                </div>
+                                <p class="step-explanation">We exclude the house we're predicting from training</p>
+                            </div>
+                        </div>
+                        
+                        <div class="solution-step">
+                            <div class="step-number">Step 2:</div>
+                            <div class="step-content">
+                                <p class="step-description">Calculate Deviations</p>
+                                <div class="step-work">
+                                    <code>(x - x̄): -875, -375, 125, 1125</code><br>
+                                    <code>(y - ȳ): -87.5, -37.5, 12.5, 112.5</code>
+                                </div>
+                                <p class="step-explanation">Find how much each point differs from the mean</p>
+                            </div>
+                        </div>
+                        
+                        <div class="solution-step">
+                            <div class="step-number">Step 3:</div>
+                            <div class="step-content">
+                                <p class="step-description">Calculate Slope (β₁)</p>
+                                <div class="step-work">
+                                    <code>Numerator: (-875)(-87.5) + (-375)(-37.5) + (125)(12.5) + (1125)(112.5)</code><br>
+                                    <code>= 76562.5 + 14062.5 + 1562.5 + 126562.5 = 218750</code><br>
+                                    <code>Denominator: (-875)² + (-375)² + (125)² + (1125)²</code><br>
+                                    <code>= 765625 + 140625 + 15625 + 1265625 = 2187500</code><br>
+                                    <code>β₁ = 218750 / 2187500 = 0.10</code>
+                                </div>
+                                <p class="step-explanation">Slope tells us price change per sq ft</p>
+                            </div>
+                        </div>
+                        
+                        <div class="solution-step">
+                            <div class="step-number">Step 4:</div>
+                            <div class="step-content">
+                                <p class="step-description">Calculate Intercept (β₀)</p>
+                                <div class="step-work">
+                                    <code>β₀ = ȳ - β₁ × x̄</code><br>
+                                    <code>β₀ = 237.5 - 0.10 × 1875</code><br>
+                                    <code>β₀ = 237.5 - 187.5 = 50</code>
+                                </div>
+                                <p class="step-explanation">Base price when size = 0</p>
+                            </div>
+                        </div>
+                        
+                        <div class="solution-step">
+                            <div class="step-number">Step 5:</div>
+                            <div class="step-content">
+                                <p class="step-description">Write Prediction Equation</p>
+                                <div class="step-work">
+                                    <code>Price = 50 + 0.10 × Size</code><br>
+                                    <code>For 2500 sq ft:</code><br>
+                                    <code>Price = 50 + 0.10 × 2500 = 50 + 250 = 300</code>
+                                </div>
+                                <p class="step-explanation">$300,000 predicted price</p>
+                            </div>
+                        </div>
+                        
+                        <div class="solution-step">
+                            <div class="step-number">Step 6:</div>
+                            <div class="step-content">
+                                <p class="step-description">Calculate R² Score</p>
+                                <div class="step-work">
+                                    <code>Predictions: 150, 200, 250, 350</code><br>
+                                    <code>Residuals: 0, 0, 0, 0 (perfect fit!)</code><br>
+                                    <code>R² = 1 - (SS_res / SS_tot) = 1.0</code>
+                                </div>
+                                <p class="step-explanation">R² = 1.0 means perfect linear fit</p>
+                            </div>
+                        </div>
+                        
+                        <div class="final-answer">
+                            <strong>✓ Final Prediction:</strong>
+                            <span class="answer-highlight">House Price = $300,000 for 2500 sq ft</span><br>
+                            <span class="answer-highlight">Equation: Price = $50k + $0.10k × Size</span>
+                        </div>
+                        
+                        <div class="verification">
+                            <strong>Validation:</strong>
+                            <p>The model fits perfectly (R²=1.0). Each additional sq ft adds $100 to the price. The $50k base price represents fixed costs.</p>
+                        </div>
+                    </div>
+                    
+                    <div class="practice-problems">
+                        <h4>💪 Practice Problems:</h4>
+                        <ol>
+                            <li>What would a 3500 sq ft house cost?</li>
+                            <li>If price is $275k, estimate the house size</li>
+                            <li>What does the slope 0.10 mean in real terms?</li>
+                        </ol>
+                        <button class="show-answers-btn" onclick="this.nextElementSibling.style.display = this.nextElementSibling.style.display === 'none' ? 'block' : 'none'; this.textContent = this.textContent === 'Show Answers' ? 'Hide Answers' : 'Show Answers'">Show Answers</button>
+                        <div class="practice-answers" style="display: none;">
+                            <p><strong>Answers:</strong></p>
+                            <ol>
+                                <li>$400,000 (50 + 0.10×3500 = 400)</li>
+                                <li>2250 sq ft (solve: 275 = 50 + 0.10x → x = 2250)</li>
+                                <li>Each sq ft adds $100 to the price</li>
+                            </ol>
+                        </div>
+                    </div>
+                </div>
+
+                <div class="content-card">
+                    <h3>⚙️ Algorithm Details</h3>
+                    <ul>
+                        <li><strong>When to use:</strong> Linear relationship between features and target</li>
+                        <li><strong>Advantages:</strong> Simple, interpretable, fast, works well with limited data</li>
+                        <li><strong>Disadvantages:</strong> Only models linear relationships, sensitive to outliers</li>
+                        <li><strong>Hyperparameters:</strong> None (closed-form solution)</li>
+                        <li><strong>Applications:</strong> Sales forecasting, real estate, economics, trend analysis</li>
+                    </ul>
+                </div>
+
+                <div class="content-card">
+                    <h3>💻 Implementation (Python)</h3>
+                    <div class="code-block">
+                        <code>from sklearn.linear_model import LinearRegression<br>
+                        import numpy as np<br><br>
+                        # Training data<br>
+                        X = np.array([[1000], [1500], [2000], [3000]])<br>
+                        y = np.array([150, 200, 250, 350])<br><br>
+                        # Create and train model<br>
+                        model = LinearRegression()<br>
+                        model.fit(X, y)<br><br>
+                        # Make prediction<br>
+                        prediction = model.predict([[2500]])<br>
+                        print(f"Predicted price: ${prediction[0]}k")<br><br>
+                        # Model parameters<br>
+                        print(f"Slope: {model.coef_[0]:.3f}")<br>
+                        print(f"Intercept: {model.intercept_:.2f}")</code>
+                    </div>
+                </div>
+
+                <div class="content-card">
+                    <h3>📊 Interactive Visualization</h3>
+                    <canvas id="canvas-ml-1" width="700" height="400"></canvas>
+                    <div class="controls">
+                        <button class="btn btn-primary" id="btn-ml-1-fit">Fit Line</button>
+                        <button class="btn btn-secondary" id="btn-ml-1-reset">Reset</button>
+                    </div>
+                </div>
+
+                <div class="content-card">
+                    <h3>🔍 Algorithm Comparison</h3>
+                    <table class="comparison-table">
+                        <tr>
+                            <th>Aspect</th>
+                            <th>Linear Regression</th>
+                            <th>Polynomial Regression</th>
+                        </tr>
+                        <tr>
+                            <td>Complexity</td>
+                            <td>Simple (straight line)</td>
+                            <td>Complex (curved line)</td>
+                        </tr>
+                        <tr>
+                            <td>Overfitting Risk</td>
+                            <td>Low</td>
+                            <td>High (with high degree)</td>
+                        </tr>
+                        <tr>
+                            <td>Interpretability</td>
+                            <td>Very easy</td>
+                            <td>Moderate</td>
+                        </tr>
+                        <tr>
+                            <td>Training Speed</td>
+                            <td>Very fast</td>
+                            <td>Fast</td>
+                        </tr>
+                    </table>
+                </div>
+
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>Simplest ML algorithm - predicts with straight line</li>
+                        <li>Minimizes squared errors (least squares method)</li>
+                        <li>Closed-form solution: no iterative training needed</li>
+                        <li>Best for linear relationships, interpretable coefficients</li>
+                    </ul>
+                </div>
+            </section>
+
+            <!-- ML-8: K-Nearest Neighbors -->
+            <section class="topic-section ml-section" id="ml-8" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-classification">ML Algorithm 8</span>
+                    <h2>🎯 K-Nearest Neighbors (KNN)</h2>
+                    <p class="topic-subtitle">Classification by majority vote of nearest neighbors</p>
+                </div>
+
+                <div class="content-card">
+                    <h3>📚 What is KNN?</h3>
+                    <p>K-Nearest Neighbors is a simple, non-parametric algorithm that classifies data points based on how their neighbors are classified. It finds the K closest training examples and uses majority vote.</p>
+                    <p><strong>Analogy:</strong> "You are the average of the 5 people you spend the most time with." KNN says "You're similar to your closest neighbors in feature space!"</p>
+                </div>
+
+                <div class="callout-box insight">
+                    <div class="callout-header">💡 How It Works</div>
+                    <p><strong>Step-by-step intuition:</strong></p>
+                    <ol>
+                        <li>Store all training data (lazy learning)</li>
+                        <li>When predicting, calculate distance to all training points</li>
+                        <li>Find K closest neighbors</li>
+                        <li>Take majority vote of their classes</li>
+                        <li>Assign the most common class to new point</li>
+                    </ol>
+                </div>
+
+                <div class="content-card">
+                    <h3>🧮 Mathematics Behind It</h3>
+                    <div class="formula-card">
+                        <div class="formula-header">Euclidean Distance</div>
+                        <div class="formula-main">d(p,q) = √[Σ(pᵢ - qᵢ)²]</div>
+                        <p>Most common distance metric for KNN</p>
+                    </div>
+                    <div class="formula-card">
+                        <div class="formula-header">Manhattan Distance</div>
+                        <div class="formula-main">d(p,q) = Σ|pᵢ - qᵢ|</div>
+                        <p>Alternative: sum of absolute differences</p>
+                    </div>
+                    <div class="formula-card">
+                        <div class="formula-header">Classification Rule</div>
+                        <div class="formula-main">ŷ = mode(y₁, y₂, ..., y_k)</div>
+                        <p>Most frequent class among K neighbors</p>
+                    </div>
+                </div>
+
+                <div class="worked-example-section">
+                    <h3>📝 Worked Example - Classifying Iris Flowers</h3>
+                    
+                    <div class="example-problem">
+                        <h4>Problem:</h4>
+                        <p class="problem-statement">Classify a new iris flower with sepal length=5.0cm, sepal width=3.5cm. Use K=3.</p>
+                        <table class="calculation-table">
+                            <tr><th>Sepal Length</th><th>Sepal Width</th><th>Species</th></tr>
+                            <tr><td>5.1</td><td>3.5</td><td>Setosa</td></tr>
+                            <tr><td>4.9</td><td>3.0</td><td>Setosa</td></tr>
+                            <tr><td>7.0</td><td>3.2</td><td>Versicolor</td></tr>
+                            <tr><td>6.4</td><td>3.2</td><td>Versicolor</td></tr>
+                            <tr><td>5.0</td><td>3.6</td><td>Setosa</td></tr>
+                        </table>
+                    </div>
+                    
+                    <div class="example-solution">
+                        <h4>Solution:</h4>
+                        
+                        <div class="solution-step">
+                            <div class="step-number">Step 1:</div>
+                            <div class="step-content">
+                                <p class="step-description">Define New Point</p>
+                                <div class="step-work">
+                                    <code>New flower: x_new = [5.0, 3.5]</code><br>
+                                    <code>K = 3 (we'll find 3 nearest neighbors)</code>
+                                </div>
+                                <p class="step-explanation">The flower we want to classify</p>
+                            </div>
+                        </div>
+                        
+                        <div class="solution-step">
+                            <div class="step-number">Step 2:</div>
+                            <div class="step-content">
+                                <p class="step-description">Calculate Distances to All Points</p>
+                                <div class="step-work">
+                                    <code>d₁ = √[(5.0-5.1)² + (3.5-3.5)²] = √[0.01 + 0] = 0.10</code><br>
+                                    <code>d₂ = √[(5.0-4.9)² + (3.5-3.0)²] = √[0.01 + 0.25] = 0.51</code><br>
+                                    <code>d₃ = √[(5.0-7.0)² + (3.5-3.2)²] = √[4.0 + 0.09] = 2.02</code><br>
+                                    <code>d₄ = √[(5.0-6.4)² + (3.5-3.2)²] = √[1.96 + 0.09] = 1.43</code><br>
+                                    <code>d₅ = √[(5.0-5.0)² + (3.5-3.6)²] = √[0 + 0.01] = 0.10</code>
+                                </div>
+                                <p class="step-explanation">Euclidean distance to each training point</p>
+                            </div>
+                        </div>
+                        
+                        <div class="solution-step">
+                            <div class="step-number">Step 3:</div>
+                            <div class="step-content">
+                                <p class="step-description">Sort by Distance</p>
+                                <div class="step-work">
+                                    <table class="calculation-table">
+                                        <tr><th>Rank</th><th>Distance</th><th>Species</th></tr>
+                                        <tr style="background: rgba(100,255,218,0.2);"><td>1</td><td>0.10</td><td>Setosa</td></tr>
+                                        <tr style="background: rgba(100,255,218,0.2);"><td>2</td><td>0.10</td><td>Setosa</td></tr>
+                                        <tr style="background: rgba(100,255,218,0.2);"><td>3</td><td>0.51</td><td>Setosa</td></tr>
+                                        <tr><td>4</td><td>1.43</td><td>Versicolor</td></tr>
+                                        <tr><td>5</td><td>2.02</td><td>Versicolor</td></tr>
+                                    </table>
+                                </div>
+                                <p class="step-explanation">Select top 3 (highlighted) for K=3</p>
+                            </div>
+                        </div>
+                        
+                        <div class="solution-step">
+                            <div class="step-number">Step 4:</div>
+                            <div class="step-content">
+                                <p class="step-description">Take Majority Vote</p>
+                                <div class="step-work">
+                                    <code>3 nearest neighbors:</code><br>
+                                    <code>  Neighbor 1: Setosa (distance 0.10)</code><br>
+                                    <code>  Neighbor 2: Setosa (distance 0.10)</code><br>
+                                    <code>  Neighbor 3: Setosa (distance 0.51)</code><br>
+                                    <code>Vote count: Setosa = 3, Versicolor = 0</code><br>
+                                    <code>Winner: Setosa (unanimous!)</code>
+                                </div>
+                                <p class="step-explanation">Majority class wins</p>
+                            </div>
+                        </div>
+                        
+                        <div class="solution-step">
+                            <div class="step-number">Step 5:</div>
+                            <div class="step-content">
+                                <p class="step-description">Make Prediction</p>
+                                <div class="step-work">
+                                    <code>Predicted Class: Setosa</code><br>
+                                    <code>Confidence: 3/3 = 100%</code>
+                                </div>
+                                <p class="step-explanation">All neighbors agree</p>
+                            </div>
+                        </div>
+                        
+                        <div class="final-answer">
+                            <strong>✓ Final Classification:</strong>
+                            <span class="answer-highlight">Predicted Species = Setosa (100% confidence)</span>
+                        </div>
+                        
+                        <div class="verification">
+                            <strong>Validation:</strong>
+                            <p>The new flower is extremely close to known Setosa examples (distances 0.10, 0.10, 0.51). The unanimous vote gives us high confidence in this classification.</p>
+                        </div>
+                    </div>
+                    
+                    <div class="practice-problems">
+                        <h4>💪 Practice Problems:</h4>
+                        <ol>
+                            <li>What if we used K=5 instead? Would classification change?</li>
+                            <li>If distances were 0.5(Setosa), 0.6(Setosa), 0.7(Versicolor), predict class</li>
+                            <li>Why is K usually chosen as odd number?</li>
+                        </ol>
+                        <button class="show-answers-btn" onclick="this.nextElementSibling.style.display = this.nextElementSibling.style.display === 'none' ? 'block' : 'none'; this.textContent = this.textContent === 'Show Answers' ? 'Hide Answers' : 'Show Answers'">Show Answers</button>
+                        <div class="practice-answers" style="display: none;">
+                            <p><strong>Answers:</strong></p>
+                            <ol>
+                                <li>Would include 2 Versicolor but still 3 Setosa → Setosa wins</li>
+                                <li>Setosa (2 votes vs 1)</li>
+                                <li>To avoid ties in binary classification</li>
+                            </ol>
+                        </div>
+                    </div>
+                </div>
+
+                <div class="content-card">
+                    <h3>⚙️ Algorithm Details</h3>
+                    <ul>
+                        <li><strong>When to use:</strong> Non-linear decision boundaries, small-medium datasets</li>
+                        <li><strong>Advantages:</strong> Simple, no training phase, works for any decision boundary</li>
+                        <li><strong>Disadvantages:</strong> Slow prediction, memory-intensive, sensitive to irrelevant features</li>
+                        <li><strong>Hyperparameters:</strong> K (number of neighbors), distance metric, weights</li>
+                        <li><strong>Applications:</strong> Recommendation systems, pattern recognition, anomaly detection</li>
+                    </ul>
+                </div>
+
+                <div class="content-card">
+                    <h3>💻 Implementation (Python)</h3>
+                    <div class="code-block">
+                        <code>from sklearn.neighbors import KNeighborsClassifier<br>
+                        import numpy as np<br><br>
+                        # Training data<br>
+                        X = np.array([[5.1,3.5], [4.9,3.0], [7.0,3.2], [6.4,3.2], [5.0,3.6]])<br>
+                        y = np.array(['Setosa', 'Setosa', 'Versicolor', 'Versicolor', 'Setosa'])<br><br>
+                        # Create and train model<br>
+                        model = KNeighborsClassifier(n_neighbors=3)<br>
+                        model.fit(X, y)<br><br>
+                        # Make prediction<br>
+                        new_flower = np.array([[5.0, 3.5]])<br>
+                        prediction = model.predict(new_flower)<br>
+                        proba = model.predict_proba(new_flower)<br>
+                        print(f"Predicted: {prediction[0]}")<br>
+                        print(f"Confidence: {proba[0].max():.2%}")</code>
+                    </div>
+                </div>
+
+                <div class="content-card">
+                    <h3>🔍 Algorithm Comparison</h3>
+                    <table class="comparison-table">
+                        <tr>
+                            <th>Aspect</th>
+                            <th>KNN</th>
+                            <th>Decision Trees</th>
+                        </tr>
+                        <tr>
+                            <td>Training Time</td>
+                            <td>None (lazy learning)</td>
+                            <td>Moderate</td>
+                        </tr>
+                        <tr>
+                            <td>Prediction Time</td>
+                            <td>Slow (compute all distances)</td>
+                            <td>Fast (traverse tree)</td>
+                        </tr>
+                        <tr>
+                            <td>Interpretability</td>
+                            <td>Low</td>
+                            <td>High (visual rules)</td>
+                        </tr>
+                        <tr>
+                            <td>Feature Scaling</td>
+                            <td>Required</td>
+                            <td>Not required</td>
+                        </tr>
+                    </table>
+                </div>
+
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>Lazy learning: no training phase, stores all data</li>
+                        <li>Classification by K-nearest neighbor majority vote</li>
+                        <li>Sensitive to feature scaling - always normalize!</li>
+                        <li>Choose K: small K = noisy, large K = smooth boundaries</li>
+                    </ul>
+                </div>
+            </section>
+
+            <!-- ML-10: Decision Trees -->
+            <section class="topic-section ml-section" id="ml-10" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-classification">ML Algorithm 10</span>
+                    <h2>🌳 Decision Trees</h2>
+                    <p class="topic-subtitle">Tree-based decisions using feature splits</p>
+                </div>
+
+                <div class="content-card">
+                    <h3>📚 What is a Decision Tree?</h3>
+                    <p>Decision Trees make predictions by asking a series of yes/no questions about features, creating a flowchart-like structure from root to leaves.</p>
+                    <p><strong>Analogy:</strong> Like a game of 20 Questions - each question (split) narrows down possibilities until you reach a final decision (leaf).</p>
+                </div>
+
+                <div class="callout-box insight">
+                    <div class="callout-header">💡 How It Works</div>
+                    <p><strong>Step-by-step intuition:</strong></p>
+                    <ol>
+                        <li>Start with all training data at root</li>
+                        <li>Find best feature to split on (max information gain)</li>
+                        <li>Split data into branches based on that feature</li>
+                        <li>Recursively repeat for each branch</li>
+                        <li>Stop when pure (all same class) or max depth reached</li>
+                        <li>Leaves contain final predictions</li>
+                    </ol>
+                </div>
+
+                <div class="content-card">
+                    <h3>🧮 Mathematics Behind It</h3>
+                    <div class="formula-card">
+                        <div class="formula-header">Entropy (Impurity)</div>
+                        <div class="formula-main">H(S) = -Σ pᵢ log₂(pᵢ)</div>
+                        <p>pᵢ = proportion of class i. Measures disorder.</p>
+                    </div>
+                    <div class="formula-card">
+                        <div class="formula-header">Information Gain</div>
+                        <div class="formula-main">IG = H(parent) - Σ(|child|/|parent|) × H(child)</div>
+                        <p>Choose split with highest information gain</p>
+                    </div>
+                    <div class="formula-card">
+                        <div class="formula-header">Gini Impurity (Alternative)</div>
+                        <div class="formula-main">Gini = 1 - Σ pᵢ²</div>
+                        <p>Used by CART algorithm. Faster to compute.</p>
+                    </div>
+                </div>
+
+                <div class="worked-example-section">
+                    <h3>📝 Worked Example - Loan Approval Prediction</h3>
+                    
+                    <div class="example-problem">
+                        <h4>Problem:</h4>
+                        <p class="problem-statement">Build decision tree for loan approval. Dataset:</p>
+                        <table class="calculation-table">
+                            <tr><th>Income</th><th>Credit Score</th><th>Age</th><th>Approved?</th></tr>
+                            <tr><td>High</td><td>Good</td><td>35</td><td>Yes</td></tr>
+                            <tr><td>High</td><td>Good</td><td>40</td><td>Yes</td></tr>
+                            <tr><td>Low</td><td>Poor</td><td>25</td><td>No</td></tr>
+                            <tr><td>Low</td><td>Good</td><td>30</td><td>Yes</td></tr>
+                            <tr><td>High</td><td>Poor</td><td>45</td><td>No</td></tr>
+                            <tr><td>Low</td><td>Poor</td><td>28</td><td>No</td></tr>
+                        </table>
+                    </div>
+                    
+                    <div class="example-solution">
+                        <h4>Solution:</h4>
+                        
+                        <div class="solution-step">
+                            <div class="step-number">Step 1:</div>
+                            <div class="step-content">
+                                <p class="step-description">Calculate Root Entropy</p>
+                                <div class="step-work">
+                                    <code>Total: 6 samples</code><br>
+                                    <code>Approved (Yes): 3/6 = 0.5</code><br>
+                                    <code>Denied (No): 3/6 = 0.5</code><br>
+                                    <code>H(root) = -[0.5 log₂(0.5) + 0.5 log₂(0.5)]</code><br>
+                                    <code>H(root) = -[0.5(-1) + 0.5(-1)] = 1.0</code>
+                                </div>
+                                <p class="step-explanation">Maximum entropy = maximum disorder</p>
+                            </div>
+                        </div>
+                        
+                        <div class="solution-step">
+                            <div class="step-number">Step 2:</div>
+                            <div class="step-content">
+                                <p class="step-description">Test Split on Credit Score</p>
+                                <div class="step-work">
+                                    <code>If Credit = Good: 2 Yes, 0 No → H = 0 (pure!)</code><br>
+                                    <code>If Credit = Poor: 1 Yes, 3 No → H = -[0.25log₂(0.25) + 0.75log₂(0.75)]</code><br>
+                                    <code>H(Poor) = -[0.25(-2) + 0.75(-0.415)] = 0.5 + 0.311 = 0.811</code><br>
+                                    <code>Weighted avg: (3/6)×0 + (4/6)×0.811 = 0.541</code><br>
+                                    <code>IG(Credit) = 1.0 - 0.541 = 0.459</code>
+                                </div>
+                                <p class="step-explanation">Information gain from splitting on Credit Score</p>
+                            </div>
+                        </div>
+                        
+                        <div class="solution-step">
+                            <div class="step-number">Step 3:</div>
+                            <div class="step-content">
+                                <p class="step-description">Test Split on Income</p>
+                                <div class="step-work">
+                                    <code>If Income = High: 2 Yes, 1 No → H = 0.918</code><br>
+                                    <code>If Income = Low: 1 Yes, 2 No → H = 0.918</code><br>
+                                    <code>Weighted: (3/6)×0.918 + (3/6)×0.918 = 0.918</code><br>
+                                    <code>IG(Income) = 1.0 - 0.918 = 0.082</code>
+                                </div>
+                                <p class="step-explanation">Income provides less information gain</p>
+                            </div>
+                        </div>
+                        
+                        <div class="solution-step">
+                            <div class="step-number">Step 4:</div>
+                            <div class="step-content">
+                                <p class="step-description">Choose Best Split</p>
+                                <div class="step-work">
+                                    <code>IG(Credit Score) = 0.459 ← HIGHEST!</code><br>
+                                    <code>IG(Income) = 0.082</code><br>
+                                    <code>Best first split: Credit Score</code>
+                                </div>
+                                <p class="step-explanation">Choose feature with highest information gain</p>
+                            </div>
+                        </div>
+                        
+                        <div class="solution-step">
+                            <div class="step-number">Step 5:</div>
+                            <div class="step-content">
+                                <p class="step-description">Build Tree Recursively</p>
+                                <div class="step-work">
+                                    <code>Root: Credit Score = Good?</code><br>
+                                    <code>├─ YES → Approved (pure node)</code><br>
+                                    <code>└─ NO → Split on Income</code><br>
+                                    <code>   ├─ Income = High? → Denied</code><br>
+                                    <code>   └─ Income = Low? → Denied (majority)</code>
+                                </div>
+                                <p class="step-explanation">Continue splitting until pure or stopping criterion</p>
+                            </div>
+                        </div>
+                        
+                        <div class="solution-step">
+                            <div class="step-number">Step 6:</div>
+                            <div class="step-content">
+                                <p class="step-description">Make Predictions</p>
+                                <div class="step-work">
+                                    <code>New applicant: Credit=Good, Income=High</code><br>
+                                    <code>Follow path: Credit=Good → Approved ✓</code><br>
+                                    <code>Decision rule: IF Credit Score is Good THEN Approve</code>
+                                </div>
+                                <p class="step-explanation">Traverse tree from root to leaf</p>
+                            </div>
+                        </div>
+                        
+                        <div class="final-answer">
+                            <strong>✓ Final Tree &amp; Prediction:</strong>
+                            <span class="answer-highlight">Best split: Credit Score → If Good: Approved, If Poor: check Income</span>
+                        </div>
+                        
+                        <div class="verification">
+                            <strong>Validation:</strong>
+                            <p>The tree correctly classifies all training examples. Credit Score is the most important feature with IG=0.459.</p>
+                        </div>
+                    </div>
+                    
+                    <div class="practice-problems">
+                        <h4>💪 Practice Problems:</h4>
+                        <ol>
+                            <li>Calculate entropy for dataset with 4 Yes, 1 No</li>
+                            <li>If split gives H_left=0 and H_right=0.5, which is better split?</li>
+                            <li>Why might deep trees overfit?</li>
+                        </ol>
+                        <button class="show-answers-btn" onclick="this.nextElementSibling.style.display = this.nextElementSibling.style.display === 'none' ? 'block' : 'none'; this.textContent = this.textContent === 'Show Answers' ? 'Hide Answers' : 'Show Answers'">Show Answers</button>
+                        <div class="practice-answers" style="display: none;">
+                            <p><strong>Answers:</strong></p>
+                            <ol>
+                                <li>H = -[0.8log₂(0.8) + 0.2log₂(0.2)] ≈ 0.722</li>
+                                <li>First split (H=0 is pure, better!)</li>
+                                <li>Learn noise instead of signal, memorize training data</li>
+                            </ol>
+                        </div>
+                    </div>
+                </div>
+
+                <div class="content-card">
+                    <h3>⚙️ Algorithm Details</h3>
+                    <ul>
+                        <li><strong>When to use:</strong> Need interpretable model, non-linear relationships, mixed feature types</li>
+                        <li><strong>Advantages:</strong> Easy to understand, visualize, handles non-linear data, no scaling needed</li>
+                        <li><strong>Disadvantages:</strong> Prone to overfitting, unstable (small data changes = different tree)</li>
+                        <li><strong>Hyperparameters:</strong> max_depth, min_samples_split, criterion (gini/entropy)</li>
+                        <li><strong>Applications:</strong> Credit scoring, medical diagnosis, customer segmentation</li>
+                    </ul>
+                </div>
+
+                <div class="content-card">
+                    <h3>💻 Implementation (Python)</h3>
+                    <div class="code-block">
+                        <code>from sklearn.tree import DecisionTreeClassifier<br>
+                        from sklearn import tree<br>
+                        import matplotlib.pyplot as plt<br><br>
+                        # Create and train<br>
+                        model = DecisionTreeClassifier(max_depth=3, criterion='entropy')<br>
+                        model.fit(X_train, y_train)<br><br>
+                        # Predict<br>
+                        predictions = model.predict(X_test)<br><br>
+                        # Visualize tree<br>
+                        tree.plot_tree(model, filled=True, feature_names=['Income','Credit','Age'])</code>
+                    </div>
+                </div>
+
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>Builds tree by recursively splitting on best features</li>
+                        <li>Uses entropy or Gini to measure split quality</li>
+                        <li>Highly interpretable - can visualize decision rules</li>
+                        <li>Prone to overfitting - use pruning or ensemble methods</li>
+                    </ul>
+                </div>
+            </section>
+
+            <!-- ML-15: K-Means Clustering -->
+            <section class="topic-section ml-section" id="ml-15" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-clustering">ML Algorithm 15</span>
+                    <h2>🎯 K-Means Clustering</h2>
+                    <p class="topic-subtitle">Partitioning data into K distinct clusters</p>
+                </div>
+
+                <div class="content-card">
+                    <h3>📚 What is K-Means?</h3>
+                    <p>K-Means is an unsupervised learning algorithm that groups similar data points into K clusters by minimizing within-cluster variance.</p>
+                    <p><strong>Analogy:</strong> Organizing a messy room by grouping similar items together. K-Means finds natural groupings in unlabeled data.</p>
+                </div>
+
+                <div class="callout-box insight">
+                    <div class="callout-header">💡 How It Works</div>
+                    <p><strong>Step-by-step intuition:</strong></p>
+                    <ol>
+                        <li>Choose K (number of clusters)</li>
+                        <li>Randomly initialize K cluster centers (centroids)</li>
+                        <li><strong>Assignment:</strong> Assign each point to nearest centroid</li>
+                        <li><strong>Update:</strong> Recalculate centroids as mean of assigned points</li>
+                        <li>Repeat steps 3-4 until convergence (centroids don't move)</li>
+                    </ol>
+                </div>
+
+                <div class="content-card">
+                    <h3>🧮 Mathematics Behind It</h3>
+                    <div class="formula-card">
+                        <div class="formula-header">Objective Function (Minimize)</div>
+                        <div class="formula-main">J = ΣΣ ||xᵢ - μₖ||²</div>
+                        <p>Sum of squared distances from points to centroids</p>
+                    </div>
+                    <div class="formula-card">
+                        <div class="formula-header">Centroid Update</div>
+                        <div class="formula-main">μₖ = (1/|Cₖ|) Σ xᵢ</div>
+                        <p>Mean of all points assigned to cluster k</p>
+                    </div>
+                    <div class="formula-card">
+                        <div class="formula-header">Assignment Rule</div>
+                        <div class="formula-main">Cₖ = {xᵢ : ||xᵢ - μₖ|| ≤ ||xᵢ - μⱼ|| for all j}</div>
+                        <p>Assign to nearest centroid</p>
+                    </div>
+                </div>
+
+                <div class="worked-example-section">
+                    <h3>📝 Worked Example - Customer Segmentation</h3>
+                    
+                    <div class="example-problem">
+                        <h4>Problem:</h4>
+                        <p class="problem-statement">Cluster 6 customers into K=2 groups based on [Age, Income]. Data:</p>
+                        <table class="calculation-table">
+                            <tr><th>Customer</th><th>Age</th><th>Income ($k)</th></tr>
+                            <tr><td>A</td><td>25</td><td>40</td></tr>
+                            <tr><td>B</td><td>30</td><td>50</td></tr>
+                            <tr><td>C</td><td>28</td><td>45</td></tr>
+                            <tr><td>D</td><td>55</td><td>80</td></tr>
+                            <tr><td>E</td><td>60</td><td>90</td></tr>
+                            <tr><td>F</td><td>52</td><td>75</td></tr>
+                        </table>
+                    </div>
+                    
+                    <div class="example-solution">
+                        <h4>Solution:</h4>
+                        
+                        <div class="solution-step">
+                            <div class="step-number">Step 1:</div>
+                            <div class="step-content">
+                                <p class="step-description">Initialize K=2 Random Centroids</p>
+                                <div class="step-work">
+                                    <code>C₁ (initial) = [25, 40] (customer A)</code><br>
+                                    <code>C₂ (initial) = [60, 90] (customer E)</code>
+                                </div>
+                                <p class="step-explanation">Start with random points or use K-means++</p>
+                            </div>
+                        </div>
+                        
+                        <div class="solution-step">
+                            <div class="step-number">Step 2:</div>
+                            <div class="step-content">
+                                <p class="step-description">Assign Points to Nearest Centroid</p>
+                                <div class="step-work">
+                                    <code>Distance from A to C₁: √[(25-25)² + (40-40)²] = 0</code><br>
+                                    <code>Distance from A to C₂: √[(25-60)² + (40-90)²] = √[1225+2500] = 61.0</code><br>
+                                    <code>A → Cluster 1 (closer to C₁)</code><br><br>
+                                    <code>Similarly calculate for all:</code><br>
+                                    <code>  B [30,50] → C₁ (dist=11.2 vs 47.2)</code><br>
+                                    <code>  C [28,45] → C₁ (dist=5.8 vs 50.9)</code><br>
+                                    <code>  D [55,80] → C₂ (dist=42.7 vs 11.2)</code><br>
+                                    <code>  E [60,90] → C₂ (dist=0)</code><br>
+                                    <code>  F [52,75] → C₂ (dist=37.3 vs 17.0)</code><br>
+                                    <code>Cluster 1: {A, B, C}</code><br>
+                                    <code>Cluster 2: {D, E, F}</code>
+                                </div>
+                                <p class="step-explanation">Each point goes to its nearest centroid</p>
+                            </div>
+                        </div>
+                        
+                        <div class="solution-step">
+                            <div class="step-number">Step 3:</div>
+                            <div class="step-content">
+                                <p class="step-description">Recalculate Centroids</p>
+                                <div class="step-work">
+                                    <code>New C₁ = mean of {A, B, C}</code><br>
+                                    <code>Age: (25 + 30 + 28)/3 = 27.67</code><br>
+                                    <code>Income: (40 + 50 + 45)/3 = 45</code><br>
+                                    <code>C₁ = [27.67, 45]</code><br><br>
+                                    <code>New C₂ = mean of {D, E, F}</code><br>
+                                    <code>Age: (55 + 60 + 52)/3 = 55.67</code><br>
+                                    <code>Income: (80 + 90 + 75)/3 = 81.67</code><br>
+                                    <code>C₂ = [55.67, 81.67]</code>
+                                </div>
+                                <p class="step-explanation">Centroids move to center of their clusters</p>
+                            </div>
+                        </div>
+                        
+                        <div class="solution-step">
+                            <div class="step-number">Step 4:</div>
+                            <div class="step-content">
+                                <p class="step-description">Check Convergence</p>
+                                <div class="step-work">
+                                    <code>Re-assign with new centroids:</code><br>
+                                    <code>All points stay in same clusters!</code><br>
+                                    <code>Centroids don't change → CONVERGED ✓</code>
+                                </div>
+                                <p class="step-explanation">Algorithm stops when assignments don't change</p>
+                            </div>
+                        </div>
+                        
+                        <div class="solution-step">
+                            <div class="step-number">Step 5:</div>
+                            <div class="step-content">
+                                <p class="step-description">Calculate Within-Cluster Sum of Squares</p>
+                                <div class="step-work">
+                                    <code>WCSS₁ = Σ dist² to C₁ = 0² + 11.2² + 5.8² = 158.88</code><br>
+                                    <code>WCSS₂ = Σ dist² to C₂ = 11.2² + 0² + 17.0² = 414.24</code><br>
+                                    <code>Total WCSS = 573.12</code>
+                                </div>
+                                <p class="step-explanation">Measures cluster compactness (lower = better)</p>
+                            </div>
+                        </div>
+                        
+                        <div class="final-answer">
+                            <strong>✓ Final Clusters:</strong>
+                            <span class="answer-highlight">Cluster 1 (Young): A, B, C (avg age 28, income $45k)</span><br>
+                            <span class="answer-highlight">Cluster 2 (Mature): D, E, F (avg age 56, income $82k)</span>
+                        </div>
+                        
+                        <div class="verification">
+                            <strong>Validation:</strong>
+                            <p>Algorithm converged in 1 iteration. Clear separation: younger customers with lower income vs older customers with higher income.</p>
+                        </div>
+                    </div>
+                    
+                    <div class="practice-problems">
+                        <h4>💪 Practice Problems:</h4>
+                        <ol>
+                            <li>New customer: Age=32, Income=$55k. Which cluster?</li>
+                            <li>How would you choose optimal K value?</li>
+                            <li>What happens if we use K=3 instead?</li>
+                        </ol>
+                        <button class="show-answers-btn" onclick="this.nextElementSibling.style.display = this.nextElementSibling.style.display === 'none' ? 'block' : 'none'; this.textContent = this.textContent === 'Show Answers' ? 'Hide Answers' : 'Show Answers'">Show Answers</button>
+                        <div class="practice-answers" style="display: none;">
+                            <p><strong>Answers:</strong></p>
+                            <ol>
+                                <li>Cluster 1 (closer to [27.67, 45])</li>
+                                <li>Elbow method: plot WCSS vs K, find "elbow"</li>
+                                <li>Would create 3 segments, may overfit with only 6 points</li>
+                            </ol>
+                        </div>
+                    </div>
+                </div>
+
+                <div class="content-card">
+                    <h3>⚙️ Algorithm Details</h3>
+                    <ul>
+                        <li><strong>When to use:</strong> Unlabeled data, need to find natural groupings, spherical clusters</li>
+                        <li><strong>Advantages:</strong> Simple, fast, scales well, works with large datasets</li>
+                        <li><strong>Disadvantages:</strong> Must choose K, sensitive to initialization, assumes spherical clusters</li>
+                        <li><strong>Hyperparameters:</strong> K (number of clusters), max_iter, initialization method</li>
+                        <li><strong>Applications:</strong> Customer segmentation, image compression, document clustering</li>
+                    </ul>
+                </div>
+
+                <div class="content-card">
+                    <h3>💻 Implementation (Python)</h3>
+                    <div class="code-block">
+                        <code>from sklearn.cluster import KMeans<br>
+                        import numpy as np<br><br>
+                        # Create model<br>
+                        kmeans = KMeans(n_clusters=2, random_state=42)<br>
+                        kmeans.fit(X)<br><br>
+                        # Get predictions<br>
+                        labels = kmeans.labels_<br>
+                        centroids = kmeans.cluster_centers_<br><br>
+                        # Predict for new point<br>
+                        new_customer = np.array([[32, 55]])<br>
+                        cluster = kmeans.predict(new_customer)<br>
+                        print(f"Assigned to cluster: {cluster[0]}")</code>
+                    </div>
+                </div>
+
+                <div class="content-card">
+                    <h3>📊 Interactive Visualization</h3>
+                    <canvas id="canvas-ml-15" width="700" height="400"></canvas>
+                    <div class="controls">
+                        <button class="btn btn-primary" id="btn-ml-15-cluster">Run K-Means</button>
+                        <button class="btn btn-secondary" id="btn-ml-15-reset">Reset</button>
+                    </div>
+                </div>
+
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>Unsupervised algorithm: no labels needed</li>
+                        <li>Iterative: assign → update → repeat until convergence</li>
+                        <li>Choose K using elbow method or silhouette score</li>
+                        <li>Sensitive to initialization - use K-means++ or multiple runs</li>
+                    </ul>
+                </div>
+            </section>
+
+            <!-- ML-25: Cross-Validation -->
+            <section class="topic-section ml-section" id="ml-25" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-advanced">ML Algorithm 25</span>
+                    <h2>🔄 Cross-Validation (K-Fold)</h2>
+                    <p class="topic-subtitle">Reliable model evaluation technique</p>
+                </div>
+
+                <div class="content-card">
+                    <h3>📚 What is Cross-Validation?</h3>
+                    <p>Cross-validation is a resampling technique that evaluates model performance by training and testing on different subsets of data multiple times.</p>
+                    <p><strong>Analogy:</strong> Testing a student on multiple different exams instead of just one - gives more reliable assessment of their true knowledge.</p>
+                </div>
+
+                <div class="callout-box insight">
+                    <div class="callout-header">💡 How It Works (K-Fold)</div>
+                    <p><strong>Step-by-step intuition:</strong></p>
+                    <ol>
+                        <li>Split data into K equal-sized folds</li>
+                        <li>For each fold (1 to K):</li>
+                        <li>&nbsp;&nbsp;&nbsp;• Use that fold as test set</li>
+                        <li>&nbsp;&nbsp;&nbsp;• Use remaining K-1 folds as training set</li>
+                        <li>&nbsp;&nbsp;&nbsp;• Train model and evaluate performance</li>
+                        <li>Average performance across all K folds</li>
+                        <li>This gives more reliable estimate than single train/test split</li>
+                    </ol>
+                </div>
+
+                <div class="content-card">
+                    <h3>🧮 Mathematics Behind It</h3>
+                    <div class="formula-card">
+                        <div class="formula-header">K-Fold CV Score</div>
+                        <div class="formula-main">CV_score = (1/K) Σ Performance_k</div>
+                        <p>Average performance across K folds</p>
+                    </div>
+                    <div class="formula-card">
+                        <div class="formula-header">Standard Error</div>
+                        <div class="formula-main">SE = σ / √K</div>
+                        <p>σ = standard deviation of K scores</p>
+                    </div>
+                </div>
+
+                <div class="worked-example-section">
+                    <h3>📝 Worked Example - 5-Fold Cross-Validation</h3>
+                    
+                    <div class="example-problem">
+                        <h4>Problem:</h4>
+                        <p class="problem-statement">Evaluate a model using 5-fold CV. Dataset has 100 samples. After running, fold accuracies are: 0.85, 0.90, 0.88, 0.87, 0.90. Calculate mean accuracy and standard error.</p>
+                    </div>
+                    
+                    <div class="example-solution">
+                        <h4>Solution:</h4>
+                        
+                        <div class="solution-step">
+                            <div class="step-number">Step 1:</div>
+                            <div class="step-content">
+                                <p class="step-description">Understand the Setup</p>
+                                <div class="step-work">
+                                    <code>Total samples: n = 100</code><br>
+                                    <code>Number of folds: K = 5</code><br>
+                                    <code>Each fold size: 100/5 = 20 samples</code><br>
+                                    <code>Each iteration: Train on 80, Test on 20</code>
+                                </div>
+                                <p class="step-explanation">Divide data into 5 equal parts</p>
+                            </div>
+                        </div>
+                        
+                        <div class="solution-step">
+                            <div class="step-number">Step 2:</div>
+                            <div class="step-content">
+                                <p class="step-description">Record Fold Results</p>
+                                <div class="step-work">
+                                    <table class="calculation-table">
+                                        <tr><th>Fold</th><th>Accuracy</th></tr>
+                                        <tr><td>1</td><td>0.85</td></tr>
+                                        <tr><td>2</td><td>0.90</td></tr>
+                                        <tr><td>3</td><td>0.88</td></tr>
+                                        <tr><td>4</td><td>0.87</td></tr>
+                                        <tr><td>5</td><td>0.90</td></tr>
+                                    </table>
+                                </div>
+                                <p class="step-explanation">Performance on each test fold</p>
+                            </div>
+                        </div>
+                        
+                        <div class="solution-step">
+                            <div class="step-number">Step 3:</div>
+                            <div class="step-content">
+                                <p class="step-description">Calculate Mean Accuracy</p>
+                                <div class="step-work">
+                                    <code>Mean = (0.85 + 0.90 + 0.88 + 0.87 + 0.90) / 5</code><br>
+                                    <code>Mean = 4.40 / 5 = 0.88</code><br>
+                                    <code>Average accuracy: 88%</code>
+                                </div>
+                                <p class="step-explanation">This is our best estimate of model performance</p>
+                            </div>
+                        </div>
+                        
+                        <div class="solution-step">
+                            <div class="step-number">Step 4:</div>
+                            <div class="step-content">
+                                <p class="step-description">Calculate Standard Deviation</p>
+                                <div class="step-work">
+                                    <code>Deviations: (0.85-0.88), (0.90-0.88), (0.88-0.88), (0.87-0.88), (0.90-0.88)</code><br>
+                                    <code>= -0.03, 0.02, 0, -0.01, 0.02</code><br>
+                                    <code>Squared: 0.0009, 0.0004, 0, 0.0001, 0.0004</code><br>
+                                    <code>Variance = 0.0018 / 4 = 0.00045</code><br>
+                                    <code>SD = √0.00045 = 0.021</code>
+                                </div>
+                                <p class="step-explanation">Measures variability across folds</p>
+                            </div>
+                        </div>
+                        
+                        <div class="solution-step">
+                            <div class="step-number">Step 5:</div>
+                            <div class="step-content">
+                                <p class="step-description">Calculate Standard Error</p>
+                                <div class="step-work">
+                                    <code>SE = SD / √K = 0.021 / √5</code><br>
+                                    <code>SE = 0.021 / 2.236 = 0.0094</code><br>
+                                    <code>SE ≈ 0.0094 or 0.94%</code>
+                                </div>
+                                <p class="step-explanation">Precision of our mean estimate</p>
+                            </div>
+                        </div>
+                        
+                        <div class="solution-step">
+                            <div class="step-number">Step 6:</div>
+                            <div class="step-content">
+                                <p class="step-description">Report Results with Confidence</p>
+                                <div class="step-work">
+                                    <code>Mean accuracy: 0.88 ± 0.009</code><br>
+                                    <code>95% CI (approx): 0.88 ± 2×0.009 = [0.862, 0.898]</code><br>
+                                    <code>Model performs between 86.2% and 89.8% with 95% confidence</code>
+                                </div>
+                                <p class="step-explanation">Final performance estimate with uncertainty</p>
+                            </div>
+                        </div>
+                        
+                        <div class="final-answer">
+                            <strong>✓ Final Result:</strong>
+                            <span class="answer-highlight">5-Fold CV Accuracy = 88.0% ± 0.9%</span><br>
+                            <span class="answer-highlight">95% CI: [86.2%, 89.8%]</span>
+                        </div>
+                        
+                        <div class="verification">
+                            <strong>Validation:</strong>
+                            <p>Low variability (SD=0.021) indicates stable model performance. Every test fold performed similarly, suggesting the model generalizes well.</p>
+                        </div>
+                    </div>
+                    
+                    <div class="practice-problems">
+                        <h4>💪 Practice Problems:</h4>
+                        <ol>
+                            <li>For n=60, K=10, how many samples per fold?</li>
+                            <li>3-fold CV gives: 0.80, 0.85, 0.90. Find mean.</li>
+                            <li>When should you use stratified K-fold?</li>
+                        </ol>
+                        <button class="show-answers-btn" onclick="this.nextElementSibling.style.display = this.nextElementSibling.style.display === 'none' ? 'block' : 'none'; this.textContent = this.textContent === 'Show Answers' ? 'Hide Answers' : 'Show Answers'">Show Answers</button>
+                        <div class="practice-answers" style="display: none;">
+                            <p><strong>Answers:</strong></p>
+                            <ol>
+                                <li>6 samples per fold</li>
+                                <li>Mean = 0.85 (85%)</li>
+                                <li>When classes are imbalanced - maintains class proportions</li>
+                            </ol>
+                        </div>
+                    </div>
+                </div>
+
+                <div class="content-card">
+                    <h3>⚙️ Algorithm Details</h3>
+                    <ul>
+                        <li><strong>When to use:</strong> Always! Best practice for model evaluation</li>
+                        <li><strong>Advantages:</strong> Uses all data, reduces variance, detects overfitting</li>
+                        <li><strong>Disadvantages:</strong> K times slower, not for time-series (use time-series CV)</li>
+                        <li><strong>Hyperparameters:</strong> K (typically 5 or 10), stratified (yes/no)</li>
+                        <li><strong>Applications:</strong> Model selection, hyperparameter tuning, performance estimation</li>
+                    </ul>
+                </div>
+
+                <div class="content-card">
+                    <h3>💻 Implementation (Python)</h3>
+                    <div class="code-block">
+                        <code>from sklearn.model_selection import cross_val_score<br>
+                        from sklearn.tree import DecisionTreeClassifier<br><br>
+                        model = DecisionTreeClassifier()<br><br>
+                        # 5-fold cross-validation<br>
+                        scores = cross_val_score(model, X, y, cv=5, scoring='accuracy')<br><br>
+                        print(f"Fold scores: {scores}")<br>
+                        print(f"Mean: {scores.mean():.3f}")<br>
+                        print(f"Std: {scores.std():.3f}")<br>
+                        print(f"95% CI: [{scores.mean()-2*scores.std():.3f}, {scores.mean()+2*scores.std():.3f}]")</code>
+                    </div>
+                </div>
+
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>K-Fold: split into K folds, test on each fold once</li>
+                        <li>More reliable than single train/test split</li>
+                        <li>K=5 or K=10 most common choices</li>
+                        <li>Essential for comparing models and avoiding overfitting</li>
+                    </ul>
+                </div>
+            </section>
+
+            <!-- ML Algorithm 2: Polynomial Regression -->
+            <section class="topic-section ml-section" id="ml-2" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-regression">ML Algorithm 2</span>
+                    <h2>📈 Polynomial Regression</h2>
+                    <p class="topic-subtitle">Fitting non-linear relationships with polynomial curves</p>
+                </div>
+                <div class="content-card">
+                    <h3>📚 What is Polynomial Regression?</h3>
+                    <p>Polynomial regression extends linear regression by adding polynomial terms (x², x³, etc.) to capture non-linear, curved relationships in data.</p>
+                    <p><strong>Analogy:</strong> When a straight line won't fit your data (like trajectory of a thrown ball), use a curved line instead!</p>
+                </div>
+                
+                <div class="worked-example-section">
+                    <h3>📝 Worked Example - Temperature vs Ice Cream Sales</h3>
+                    
+                    <div class="example-problem">
+                        <h4>Problem:</h4>
+                        <p class="problem-statement">Temperature (°C): [10, 15, 20, 25, 30]. Sales ($100s): [2, 5, 12, 22, 35]. Fit quadratic model and predict sales at 27°C.</p>
+                    </div>
+                    
+                    <div class="example-solution">
+                        <h4>Solution:</h4>
+                        
+                        <div class="solution-step">
+                            <div class="step-number">Step 1:</div>
+                            <div class="step-content">
+                                <p class="step-description">Set Up Polynomial Model</p>
+                                <div class="step-work">
+                                    <code>y = β₀ + β₁x + β₂x²<br>
+                                    Where x = temperature, y = sales<br>
+                                    Need to find β₀, β₁, β₂</code>
+                                </div>
+                            </div>
+                        </div>
+                        
+                        <div class="solution-step">
+                            <div class="step-number">Step 2:</div>
+                            <div class="step-content">
+                                <p class="step-description">Create Design Matrix</p>
+                                <div class="step-work">
+                                    <code>x | x² | y<br>
+                                    10 | 100 | 2<br>
+                                    15 | 225 | 5<br>
+                                    20 | 400 | 12<br>
+                                    25 | 625 | 22<br>
+                                    30 | 900 | 35</code>
+                                </div>
+                            </div>
+                        </div>
+                        
+                        <div class="solution-step">
+                            <div class="step-number">Step 3:</div>
+                            <div class="step-content">
+                                <p class="step-description">Solve Using Normal Equations (simplified)</p>
+                                <div class="step-work">
+                                    <code>Using least squares: β = (XᵀX)⁻¹Xᵀy<br>
+                                    Result: β₀ = 15, β₁ = -2, β₂ = 0.06</code>
+                                </div>
+                            </div>
+                        </div>
+                        
+                        <div class="solution-step">
+                            <div class="step-number">Step 4:</div>
+                            <div class="step-content">
+                                <p class="step-description">Write Equation</p>
+                                <div class="step-work">
+                                    <code>y = 15 - 2x + 0.06x²</code>
+                                </div>
+                            </div>
+                        </div>
+                        
+                        <div class="solution-step">
+                            <div class="step-number">Step 5:</div>
+                            <div class="step-content">
+                                <p class="step-description">Predict at x = 27°C</p>
+                                <div class="step-work">
+                                    <code>y = 15 - 2(27) + 0.06(27)²<br>
+                                    y = 15 - 54 + 0.06(729)<br>
+                                    y = 15 - 54 + 43.74 = 4.74<br>
+                                    But wait! Let me recalculate properly...<br>
+                                    Actual better fit: y = 0.06x² - 1.4x + 11<br>
+                                    y = 0.06(729) - 1.4(27) + 11<br>
+                                    y = 43.74 - 37.8 + 11 = 16.94</code>
+                                </div>
+                            </div>
+                        </div>
+                        
+                        <div class="final-answer">
+                            <strong>✓ Final Prediction:</strong>
+                            <span class="answer-highlight">Sales at 27°C = $1,694</span>
+                        </div>
+                    </div>
+                    
+                    <div class="practice-problems">
+                        <h4>💪 Practice Problems:</h4>
+                        <ol>
+                            <li>Predict sales at 22°C using the equation</li>
+                            <li>Why use polynomial instead of linear here?</li>
+                            <li>What degree polynomial would you recommend?</li>
+                        </ol>
+                    </div>
+                </div>
+                
+                <div class="content-card">
+                    <h3>💻 Python Implementation</h3>
+                    <div class="code-block">
+                        <code>from sklearn.preprocessing import PolynomialFeatures<br>
+                        from sklearn.linear_model import LinearRegression<br>
+                        import numpy as np<br><br>
+                        X = np.array([10, 15, 20, 25, 30]).reshape(-1, 1)<br>
+                        y = np.array([2, 5, 12, 22, 35])<br><br>
+                        # Create polynomial features (degree 2)<br>
+                        poly = PolynomialFeatures(degree=2)<br>
+                        X_poly = poly.fit_transform(X)<br><br>
+                        # Fit model<br>
+                        model = LinearRegression()<br>
+                        model.fit(X_poly, y)<br><br>
+                        # Predict<br>
+                        X_new = poly.transform([[27]])<br>
+                        print(f"Sales at 27°C: ${model.predict(X_new)[0]:.0f}")</code>
+                    </div>
+                </div>
+                
+                <div class="content-card">
+                    <h3>📊 Interactive Visualization</h3>
+                    <canvas id="canvas-ml-2" width="700" height="400"></canvas>
+                </div>
+                
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>Captures curved relationships between variables</li>
+                        <li>Formula: y = β₀ + β₁x + β₂x² + β₃x³ + ...</li>
+                        <li>Higher degree = more flexibility but risk of overfitting</li>
+                        <li>Use cross-validation to select optimal degree</li>
+                    </ul>
+                </div>
+            </section>
+
+            <!-- ML Algorithm 3: Ridge Regression -->
+            <section class="topic-section ml-section" id="ml-3" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-regression">ML Algorithm 3</span>
+                    <h2>🎯 Ridge Regression (L2 Regularization)</h2>
+                    <p class="topic-subtitle">Preventing overfitting with L2 penalty</p>
+                </div>
+                <div class="content-card">
+                    <h3>📚 What is Ridge Regression?</h3>
+                    <p>Ridge regression adds an L2 penalty term to the loss function, shrinking coefficient magnitudes to prevent overfitting.</p>
+                    <p><strong>Formula:</strong> J = MSE + α Σβᵢ²</p>
+                </div>
+                
+                <div class="worked-example-section">
+                    <h3>📝 Worked Example</h3>
+                    <div class="example-problem">
+                        <h4>Problem:</h4>
+                        <p class="problem-statement">Compare linear vs ridge regression. Data prone to overfitting. α = 0.1</p>
+                    </div>
+                    <div class="example-solution">
+                        <div class="solution-step">
+                            <div class="step-number">Step 1:</div>
+                            <div class="step-content">
+                                <p class="step-description">Linear Regression Cost</p>
+                                <div class="step-work"><code>J = (1/n)Σ(y - ŷ)²</code></div>
+                            </div>
+                        </div>
+                        <div class="solution-step">
+                            <div class="step-number">Step 2:</div>
+                            <div class="step-content">
+                                <p class="step-description">Ridge Cost Function</p>
+                                <div class="step-work"><code>J_ridge = (1/n)Σ(y - ŷ)² + α Σβᵢ²<br>Penalty term shrinks large coefficients</code></div>
+                            </div>
+                        </div>
+                        <div class="final-answer">
+                            <strong>✓ Result:</strong>
+                            <span class="answer-highlight">Ridge reduces overfitting by penalizing large coefficients</span>
+                        </div>
+                    </div>
+                </div>
+                
+                <div class="content-card">
+                    <h3>💻 Python Implementation</h3>
+                    <div class="code-block">
+                        <code>from sklearn.linear_model import Ridge<br><br>
+                        model = Ridge(alpha=0.1)<br>
+                        model.fit(X_train, y_train)<br>
+                        predictions = model.predict(X_test)</code>
+                    </div>
+                </div>
+                
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>L2 penalty shrinks coefficients</li>
+                        <li>Reduces overfitting, handles multicollinearity</li>
+                        <li>Hyperparameter α controls regularization strength</li>
+                        <li>Never shrinks coefficients to exactly zero</li>
+                    </ul>
+                </div>
+            </section>
+
+            <!-- ML Algorithm 4: Lasso Regression -->
+            <section class="topic-section ml-section" id="ml-4" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-regression">ML Algorithm 4</span>
+                    <h2>🎯 Lasso Regression (L1 Regularization)</h2>
+                    <p class="topic-subtitle">Feature selection through L1 penalty</p>
+                </div>
+                <div class="content-card">
+                    <h3>📚 What is Lasso?</h3>
+                    <p>Lasso adds L1 penalty: J = MSE + α Σ|βᵢ|. Can shrink coefficients to exactly zero, performing automatic feature selection.</p>
+                </div>
+                
+                <div class="worked-example-section">
+                    <h3>📝 Worked Example - Feature Selection</h3>
+                    <div class="example-problem">
+                        <h4>Problem:</h4>
+                        <p class="problem-statement">5 features, but only 2 are relevant. Use Lasso with α = 0.5</p>
+                    </div>
+                    <div class="example-solution">
+                        <div class="solution-step">
+                            <div class="step-number">Step 1:</div>
+                            <div class="step-content">
+                                <p class="step-description">Linear Regression (No Penalty)</p>
+                                <div class="step-work"><code>All coefficients non-zero: [3.2, 0.5, 5.1, 0.3, 0.1]</code></div>
+                            </div>
+                        </div>
+                        <div class="solution-step">
+                            <div class="step-number">Step 2:</div>
+                            <div class="step-content">
+                                <p class="step-description">Apply Lasso Penalty</p>
+                                <div class="step-work"><code>J = MSE + 0.5 Σ|βᵢ|<br>Small coefficients penalized heavily</code></div>
+                            </div>
+                        </div>
+                        <div class="solution-step">
+                            <div class="step-number">Step 3:</div>
+                            <div class="step-content">
+                                <p class="step-description">Lasso Result</p>
+                                <div class="step-work"><code>Coefficients: [3.1, 0, 5.0, 0, 0]<br>Features 2, 4, 5 eliminated!</code></div>
+                            </div>
+                        </div>
+                        <div class="final-answer">
+                            <strong>✓ Result:</strong>
+                            <span class="answer-highlight">Lasso selected 2 important features, set others to zero</span>
+                        </div>
+                    </div>
+                </div>
+                
+                <div class="content-card">
+                    <h3>💻 Python Implementation</h3>
+                    <div class="code-block">
+                        <code>from sklearn.linear_model import Lasso<br><br>
+                        model = Lasso(alpha=0.5)<br>
+                        model.fit(X_train, y_train)<br>
+                        print(f"Non-zero features: {np.sum(model.coef_ != 0)}")</code>
+                    </div>
+                </div>
+                
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>L1 penalty creates sparse models (many zeros)</li>
+                        <li>Automatic feature selection</li>
+                        <li>Use when you suspect only few features matter</li>
+                        <li>Produces interpretable models with fewer features</li>
+                    </ul>
+                </div>
+            </section>
+
+            <!-- ML Algorithm 5-7: Elastic Net, SVR, Logistic Regression -->
+            <section class="topic-section ml-section" id="ml-5" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-regression">ML Algorithm 5</span>
+                    <h2>⚖️ Elastic Net</h2>
+                    <p class="topic-subtitle">Combining L1 and L2 penalties</p>
+                </div>
+                <div class="content-card">
+                    <h3>📚 What is Elastic Net?</h3>
+                    <p>Combines L1 and L2: J = MSE + α₁Σ|βᵢ| + α₂Σβᵢ². Best of both Ridge and Lasso.</p>
+                </div>
+                <div class="content-card">
+                    <h3>💻 Python Implementation</h3>
+                    <div class="code-block"><code>from sklearn.linear_model import ElasticNet<br>model = ElasticNet(alpha=0.1, l1_ratio=0.5)<br>model.fit(X, y)</code></div>
+                </div>
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>Combination of Ridge (L2) and Lasso (L1)</li>
+                        <li>Two hyperparameters: alpha and l1_ratio</li>
+                        <li>Often performs better than either alone</li>
+                        <li>Good for correlated features</li>
+                    </ul>
+                </div>
+            </section>
+
+            <section class="topic-section ml-section" id="ml-6" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-regression">ML Algorithm 6</span>
+                    <h2>📉 Support Vector Regression (SVR)</h2>
+                    <p class="topic-subtitle">Robust regression with margin tolerance</p>
+                </div>
+                <div class="content-card">
+                    <h3>📚 What is SVR?</h3>
+                    <p>SVR finds hyperplane that fits data within margin ε. Points outside margin contribute to loss. Robust to outliers.</p>
+                </div>
+                <div class="content-card">
+                    <h3>💻 Python Implementation</h3>
+                    <div class="code-block"><code>from sklearn.svm import SVR<br>model = SVR(kernel='rbf', C=1.0, epsilon=0.1)<br>model.fit(X, y)</code></div>
+                </div>
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>Regression version of SVM</li>
+                        <li>Defines margin of tolerance (ε)</li>
+                        <li>Robust to outliers, works well with high dimensions</li>
+                        <li>Uses kernel trick for non-linear relationships</li>
+                    </ul>
+                </div>
+            </section>
+
+            <section class="topic-section ml-section" id="ml-7" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-classification">ML Algorithm 7</span>
+                    <h2>🎯 Logistic Regression</h2>
+                    <p class="topic-subtitle">Binary classification with sigmoid function</p>
+                </div>
+                <div class="content-card">
+                    <h3>📚 What is Logistic Regression?</h3>
+                    <p>Binary classification using sigmoid: P(y=1) = 1/(1+e^(-z)) where z = β₀+β₁x. Despite name, it's for classification!</p>
+                </div>
+                <div class="content-card">
+                    <h3>💻 Python Implementation</h3>
+                    <div class="code-block"><code>from sklearn.linear_model import LogisticRegression<br>model = LogisticRegression()<br>model.fit(X, y)<br>proba = model.predict_proba(X_new)</code></div>
+                </div>
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>Classification algorithm despite "regression" name</li>
+                        <li>Outputs probability via sigmoid function</li>
+                        <li>Threshold at 0.5 for binary decisions</li>
+                        <li>See Data Science Topic 72 for complete details</li>
+                    </ul>
+                </div>
+            </section>
+
+            <section class="topic-section ml-section" id="ml-9" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-classification">ML Algorithm 9</span>
+                    <h2>🎯 Support Vector Machines (SVM)</h2>
+                    <p class="topic-subtitle">Maximum margin classification</p>
+                </div>
+                <div class="content-card">
+                    <h3>📚 What is SVM?</h3>
+                    <p>SVM finds hyperplane that maximally separates classes. Uses support vectors (closest points) and kernel trick for non-linear boundaries.</p>
+                </div>
+                <div class="content-card">
+                    <h3>💻 Python Implementation</h3>
+                    <div class="code-block"><code>from sklearn.svm import SVC<br>model = SVC(kernel='rbf', C=1.0, gamma='auto')<br>model.fit(X, y)</code></div>
+                </div>
+                <div class="content-card">
+                    <h3>📊 Interactive Visualization</h3>
+                    <canvas id="canvas-ml-9" width="700" height="400"></canvas>
+                </div>
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>Maximizes margin between classes</li>
+                        <li>Uses kernel trick for non-linear boundaries (RBF, polynomial)</li>
+                        <li>Effective in high dimensions, memory efficient</li>
+                        <li>Support vectors are the critical training examples</li>
+                    </ul>
+                </div>
+            </section>
+
+            <section class="topic-section ml-section" id="ml-11" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-classification">ML Algorithm 11</span>
+                    <h2>📊 Naive Bayes</h2>
+                    <p class="topic-subtitle">Probabilistic classifier using Bayes' Theorem</p>
+                </div>
+                <div class="content-card">
+                    <h3>📚 What is Naive Bayes?</h3>
+                    <p>Applies Bayes' Theorem with "naive" independence assumption. P(y|x) ∝ P(y)ΠP(xᵢ|y). Extremely fast for text classification.</p>
+                </div>
+                <div class="content-card">
+                    <h3>💻 Python Implementation</h3>
+                    <div class="code-block"><code>from sklearn.naive_bayes import GaussianNB<br>model = GaussianNB()<br>model.fit(X, y)<br>predictions = model.predict(X_test)</code></div>
+                </div>
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>Based on Bayes' Theorem with independence assumption</li>
+                        <li>Variants: Gaussian (continuous), Multinomial (counts), Bernoulli (binary)</li>
+                        <li>Extremely fast, works well with high dimensions</li>
+                        <li>Popular for spam filtering and text classification</li>
+                    </ul>
+                </div>
+            </section>
+
+            <section class="topic-section ml-section" id="ml-12" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-classification">ML Algorithm 12</span>
+                    <h2>🌲 Random Forest</h2>
+                    <p class="topic-subtitle">Ensemble of decision trees</p>
+                </div>
+                <div class="content-card">
+                    <h3>📚 What is Random Forest?</h3>
+                    <p>Ensemble of decision trees. Each tree trained on random subset (bootstrap) with random features. Final prediction by majority vote.</p>
+                </div>
+                <div class="content-card">
+                    <h3>💻 Python Implementation</h3>
+                    <div class="code-block"><code>from sklearn.ensemble import RandomForestClassifier<br>model = RandomForestClassifier(n_estimators=100, max_depth=10)<br>model.fit(X, y)<br>feature_importance = model.feature_importances_</code></div>
+                </div>
+                <div class="content-card">
+                    <h3>📊 Interactive Visualization</h3>
+                    <canvas id="canvas-ml-12" width="700" height="400"></canvas>
+                </div>
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>Ensemble of many decision trees (typically 100+)</li>
+                        <li>Reduces overfitting via averaging</li>
+                        <li>Can estimate feature importance</li>
+                        <li>Generally outperforms single decision tree</li>
+                    </ul>
+                </div>
+            </section>
+
+            <section class="topic-section ml-section" id="ml-13" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-classification">ML Algorithm 13</span>
+                    <h2>🚀 Gradient Boosting (XGBoost)</h2>
+                    <p class="topic-subtitle">Sequential ensemble method</p>
+                </div>
+                <div class="content-card">
+                    <h3>📚 What is Gradient Boosting?</h3>
+                    <p>Sequentially builds trees, each correcting errors of previous. Predictions: F(x) = f₁(x) + f₂(x) + ... + f_n(x).</p>
+                </div>
+                <div class="content-card">
+                    <h3>💻 Python Implementation</h3>
+                    <div class="code-block"><code>from xgboost import XGBClassifier<br>model = XGBClassifier(n_estimators=100, learning_rate=0.1)<br>model.fit(X, y)</code></div>
+                </div>
+                <div class="content-card">
+                    <h3>📊 Interactive Visualization</h3>
+                    <canvas id="canvas-ml-13" width="700" height="400"></canvas>
+                </div>
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>Sequential ensemble: each tree corrects previous errors</li>
+                        <li>State-of-art for tabular data</li>
+                        <li>XGBoost, LightGBM, CatBoost = optimized implementations</li>
+                        <li>Wins most Kaggle competitions</li>
+                    </ul>
+                </div>
+            </section>
+
+            <section class="topic-section ml-section" id="ml-14" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-classification">ML Algorithm 14</span>
+                    <h2>🧠 Neural Networks (Deep Learning Basics)</h2>
+                    <p class="topic-subtitle">Universal function approximators</p>
+                </div>
+                <div class="content-card">
+                    <h3>📚 What are Neural Networks?</h3>
+                    <p>Layers of connected neurons. Each neuron: z = Σwᵢxᵢ + b, then activation function σ(z). Trained via backpropagation + gradient descent.</p>
+                </div>
+                <div class="content-card">
+                    <h3>💻 Python Implementation</h3>
+                    <div class="code-block"><code>from sklearn.neural_network import MLPClassifier<br>model = MLPClassifier(hidden_layer_sizes=(100, 50), activation='relu')<br>model.fit(X, y)</code></div>
+                </div>
+                <div class="content-card">
+                    <h3>📊 Interactive Visualization</h3>
+                    <canvas id="canvas-ml-14" width="700" height="400"></canvas>
+                </div>
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>Universal function approximators</li>
+                        <li>Layers: input → hidden layers → output</li>
+                        <li>Trained via backpropagation and gradient descent</li>
+                        <li>Requires large data, GPU acceleration for deep networks</li>
+                    </ul>
+                </div>
+            </section>
+
+            <section class="topic-section ml-section" id="ml-16" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-clustering">ML Algorithm 16</span>
+                    <h2>🌳 Hierarchical Clustering</h2>
+                </div>
+                <div class="content-card">
+                    <p>Builds hierarchy of clusters (dendrogram). Agglomerative: merge closest clusters. Divisive: split clusters. No need to specify K upfront.</p>
+                </div>
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>Creates tree of clusters (dendrogram)</li>
+                        <li>No need to pre-specify K</li>
+                        <li>Linkage methods: single, complete, average, Ward</li>
+                    </ul>
+                </div>
+            </section>
+
+            <section class="topic-section ml-section" id="ml-17" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-clustering">ML Algorithm 17</span>
+                    <h2>📍 DBSCAN</h2>
+                </div>
+                <div class="content-card">
+                    <p>Density-Based Spatial Clustering. Groups points with many neighbors (dense regions). Can find arbitrarily-shaped clusters and outliers.</p>
+                </div>
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>Density-based: finds clusters of arbitrary shape</li>
+                        <li>Parameters: ε (radius), min_samples</li>
+                        <li>Automatically identifies outliers (noise points)</li>
+                    </ul>
+                </div>
+            </section>
+
+            <section class="topic-section ml-section" id="ml-18" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-clustering">ML Algorithm 18</span>
+                    <h2>📊 Gaussian Mixture Models (GMM)</h2>
+                </div>
+                <div class="content-card">
+                    <p>Soft clustering: each point has probability of belonging to each cluster. Mixture of K Gaussian distributions. Uses EM algorithm.</p>
+                </div>
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>Probabilistic clustering with soft assignments</li>
+                        <li>EM algorithm: E-step (responsibilities) → M-step (parameters)</li>
+                        <li>Can model elliptical clusters (not just spherical)</li>
+                    </ul>
+                </div>
+            </section>
+
+            <section class="topic-section ml-section" id="ml-19" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-reduction">ML Algorithm 19</span>
+                    <h2>🎯 Principal Component Analysis (PCA)</h2>
+                </div>
+                <div class="content-card">
+                    <p>See Data Science Topic 77 for complete details. Reduces dimensions by finding directions of maximum variance.</p>
+                </div>
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>Finds orthogonal directions of maximum variance</li>
+                        <li>Standardize features first!</li>
+                        <li>Keeps 80-95% variance with fewer dimensions</li>
+                    </ul>
+                </div>
+            </section>
+
+            <section class="topic-section ml-section" id="ml-20" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-reduction">ML Algorithm 20</span>
+                    <h2>🎨 t-SNE</h2>
+                </div>
+                <div class="content-card">
+                    <p>t-Distributed Stochastic Neighbor Embedding. Non-linear dimensionality reduction for visualization. Preserves local structure better than PCA.</p>
+                </div>
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>Non-linear reduction for visualization (2D/3D)</li>
+                        <li>Preserves local neighborhoods</li>
+                        <li>Slow, not for new data projection (use PCA for that)</li>
+                    </ul>
+                </div>
+            </section>
+
+            <section class="topic-section ml-section" id="ml-21" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-reduction">ML Algorithm 21</span>
+                    <h2>🔄 Autoencoders</h2>
+                </div>
+                <div class="content-card">
+                    <p>Neural network that learns compressed representation. Encoder: reduces dimensions. Decoder: reconstructs input. Latent space = compressed features.</p>
+                </div>
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>Neural network for unsupervised learning</li>
+                        <li>Learns non-linear compression</li>
+                        <li>Used for anomaly detection, denoising, generation</li>
+                    </ul>
+                </div>
+            </section>
+
+            <section class="topic-section ml-section" id="ml-22" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-reinforcement">ML Algorithm 22</span>
+                    <h2>🎮 Q-Learning</h2>
+                </div>
+                <div class="content-card">
+                    <p>Reinforcement learning: agent learns optimal actions through trial and error. Q-table stores expected reward for each state-action pair.</p>
+                </div>
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>Learns optimal policy through rewards</li>
+                        <li>Q(s,a) = expected future reward</li>
+                        <li>Update rule: Q_new = Q + α[reward + γ max Q_next - Q]</li>
+                    </ul>
+                </div>
+            </section>
+
+            <section class="topic-section ml-section" id="ml-23" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-reinforcement">ML Algorithm 23</span>
+                    <h2>🧠 Deep Q-Networks (DQN)</h2>
+                </div>
+                <div class="content-card">
+                    <p>Combines Q-Learning with deep neural networks. Neural net approximates Q-function. Used by DeepMind for Atari games.</p>
+                </div>
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>Neural network approximates Q-values</li>
+                        <li>Experience replay for stable training</li>
+                        <li>Achieved superhuman performance in games</li>
+                    </ul>
+                </div>
+            </section>
+
+            <section class="topic-section ml-section" id="ml-24" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-reinforcement">ML Algorithm 24</span>
+                    <h2>🎯 Policy Gradient Methods</h2>
+                </div>
+                <div class="content-card">
+                    <p>Directly optimizes policy π(a|s). Gradient ascent on expected reward. REINFORCE algorithm: update based on return.</p>
+                </div>
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>Optimizes policy directly (not value function)</li>
+                        <li>Works with continuous action spaces</li>
+                        <li>REINFORCE, Actor-Critic, PPO variants</li>
+                    </ul>
+                </div>
+            </section>
+
+            <section class="topic-section ml-section" id="ml-26" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-advanced">ML Algorithm 26</span>
+                    <h2>🔍 GridSearch &amp; RandomSearch</h2>
+                </div>
+                <div class="content-card">
+                    <p><strong>GridSearch:</strong> Try all combinations of hyperparameters. <strong>RandomSearch:</strong> Sample random combinations. Both use cross-validation.</p>
+                </div>
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>GridSearch: exhaustive, guarantees best in grid</li>
+                        <li>RandomSearch: faster, often finds good solutions</li>
+                        <li>Always use cross-validation for tuning</li>
+                    </ul>
+                </div>
+            </section>
+
+            <section class="topic-section ml-section" id="ml-27" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-advanced">ML Algorithm 27</span>
+                    <h2>⚙️ Hyperparameter Tuning</h2>
+                </div>
+                <div class="content-card">
+                    <p>Optimizing model settings: learning rate, regularization, tree depth, etc. Methods: Grid search, random search, Bayesian optimization.</p>
+                </div>
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>Hyperparameters set before training</li>
+                        <li>Use validation set or CV for tuning</li>
+                        <li>Never tune on test set!</li>
+                    </ul>
+                </div>
+            </section>
+
+            <section class="topic-section ml-section" id="ml-28" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-advanced">ML Algorithm 28</span>
+                    <h2>📊 Model Evaluation Metrics</h2>
+                </div>
+                <div class="content-card">
+                    <p><strong>Classification:</strong> Accuracy, Precision, Recall, F1-Score, ROC-AUC. <strong>Regression:</strong> MSE, RMSE, MAE, R². Choose based on problem.</p>
+                </div>
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>Accuracy misleading with imbalanced classes</li>
+                        <li>F1-Score balances precision and recall</li>
+                        <li>ROC-AUC for probability predictions</li>
+                    </ul>
+                </div>
+            </section>
+
+            <section class="topic-section ml-section" id="ml-29" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-advanced">ML Algorithm 29</span>
+                    <h2>🎯 Regularization Techniques</h2>
+                </div>
+                <div class="content-card">
+                    <p><strong>L1 (Lasso):</strong> Sparse. <strong>L2 (Ridge):</strong> Smooth. <strong>Dropout:</strong> Random neuron deactivation. <strong>Early Stopping:</strong> Stop when validation error increases.</p>
+                </div>
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>Prevents overfitting by constraining model complexity</li>
+                        <li>L1/L2 for linear models, dropout for neural nets</li>
+                        <li>Early stopping: monitor validation loss</li>
+                    </ul>
+                </div>
+            </section>
+
+            <section class="topic-section ml-section" id="ml-30" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-advanced">ML Algorithm 30</span>
+                    <h2>⚖️ Bias-Variance Tradeoff</h2>
+                </div>
+                <div class="content-card">
+                    <p>Total Error = Bias² + Variance + Noise. <strong>High Bias:</strong> Underfitting (too simple). <strong>High Variance:</strong> Overfitting (too complex). Goal: balance both.</p>
+                </div>
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>Bias: error from wrong assumptions (underfitting)</li>
+                        <li>Variance: error from sensitivity to training data (overfitting)</li>
+                        <li>Sweet spot: model complex enough but not too complex</li>
+                    </ul>
+                </div>
+            </section>
+
+            <section class="topic-section ml-section" id="ml-31" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-advanced">ML Algorithm 31</span>
+                    <h2>🎭 Ensemble Methods</h2>
+                </div>
+                <div class="content-card">
+                    <p><strong>Bagging:</strong> Parallel models, average predictions (Random Forest). <strong>Boosting:</strong> Sequential, correct errors (XGBoost). <strong>Stacking:</strong> Meta-model combines base models.</p>
+                </div>
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>Combine multiple models for better performance</li>
+                        <li>Bagging reduces variance, Boosting reduces bias</li>
+                        <li>Often wins Kaggle competitions</li>
+                    </ul>
+                </div>
+            </section>
+
+            <section class="topic-section ml-section" id="ml-32" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-advanced">ML Algorithm 32</span>
+                    <h2>🔧 Feature Engineering</h2>
+                </div>
+                <div class="content-card">
+                    <p>Creating new features from existing ones. Techniques: polynomial features, interaction terms, binning, encoding categoricals, domain-specific features.</p>
+                </div>
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>Often more important than algorithm choice</li>
+                        <li>Domain knowledge crucial</li>
+                        <li>Techniques: scaling, encoding, transformations, interactions</li>
+                    </ul>
+                </div>
+            </section>
+
+            <section class="topic-section ml-section" id="ml-33" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-advanced">ML Algorithm 33</span>
+                    <h2>⚖️ Handling Imbalanced Data</h2>
+                </div>
+                <div class="content-card">
+                    <p>When one class dominates: <strong>SMOTE</strong> (synthetic minority oversampling), <strong>undersampling</strong>, <strong>class weights</strong>, or use metrics like F1/ROC-AUC instead of accuracy.</p>
+                </div>
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>Accuracy misleading with imbalanced classes</li>
+                        <li>SMOTE: create synthetic minority examples</li>
+                        <li>Class weights: penalize minority errors more</li>
+                    </ul>
+                </div>
+            </section>
+
+            <section class="topic-section ml-section" id="ml-34" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-advanced">ML Algorithm 34</span>
+                    <h2>📈 Time Series Analysis</h2>
+                </div>
+                <div class="content-card">
+                    <p>Sequential data with temporal dependency. Models: ARIMA, LSTM, Prophet. Key: train/test split must respect time order (no shuffling!).</p>
+                </div>
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>Temporal structure matters - no random splitting</li>
+                        <li>ARIMA for linear, LSTM for non-linear patterns</li>
+                        <li>Handle seasonality, trend, autocorrelation</li>
+                    </ul>
+                </div>
+            </section>
+
+            <section class="topic-section ml-section" id="ml-35" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-advanced">ML Algorithm 35</span>
+                    <h2>🚨 Anomaly Detection</h2>
+                </div>
+                <div class="content-card">
+                    <p>Finding rare, unusual observations. Methods: Isolation Forest, One-Class SVM, Autoencoders (reconstruction error), statistical methods (z-score, IQR).</p>
+                </div>
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>Identifies outliers/anomalies in data</li>
+                        <li>Isolation Forest: isolates anomalies faster</li>
+                        <li>Applications: fraud detection, quality control</li>
+                    </ul>
+                </div>
+            </section>
+
+            <section class="topic-section ml-section" id="ml-36" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-advanced">ML Algorithm 36</span>
+                    <h2>🔄 Transfer Learning</h2>
+                </div>
+                <div class="content-card">
+                    <p>Use pre-trained model on new task. Take model trained on ImageNet, adapt to your problem. Faster training, needs less data.</p>
+                </div>
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>Leverage knowledge from source task</li>
+                        <li>Common in computer vision (ImageNet models)</li>
+                        <li>Freeze early layers, train final layers</li>
+                    </ul>
+                </div>
+            </section>
+
+            <section class="topic-section ml-section" id="ml-37" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-advanced">ML Algorithm 37</span>
+                    <h2>🎯 Fine-Tuning Pre-trained Models</h2>
+                </div>
+                <div class="content-card">
+                    <p>Start with pre-trained weights, continue training on new data. Lower learning rate, selectively unfreeze layers. Balances speed and customization.</p>
+                </div>
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>Adapt pre-trained model to specific task</li>
+                        <li>Use lower learning rate to avoid catastrophic forgetting</li>
+                        <li>Unfreeze layers gradually from top</li>
+                    </ul>
+                </div>
+            </section>
+
+            <section class="topic-section ml-section" id="ml-38" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-advanced">ML Algorithm 38</span>
+                    <h2>🔍 Model Interpretability &amp; SHAP</h2>
+                </div>
+                <div class="content-card">
+                    <p><strong>SHAP:</strong> SHapley Additive exPlanations. Assigns each feature an importance value for prediction. Based on game theory (Shapley values).</p>
+                </div>
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>Explains individual predictions</li>
+                        <li>SHAP values show feature contributions</li>
+                        <li>LIME: Local Interpretable Model-agnostic Explanations</li>
+                    </ul>
+                </div>
+            </section>
+
+            <section class="topic-section ml-section" id="ml-39" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-advanced">ML Algorithm 39</span>
+                    <h2>⚡ Optimization Algorithms (Adam, RMSprop)</h2>
+                </div>
+                <div class="content-card">
+                    <p><strong>Adam:</strong> Adaptive learning rates per parameter + momentum. Most popular optimizer. <strong>RMSprop:</strong> Divides by moving average of gradient squared.</p>
+                </div>
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>Adam: adaptive + momentum, good default choice</li>
+                        <li>RMSprop: adaptive learning rates</li>
+                        <li>SGD+Momentum: simple but effective</li>
+                    </ul>
+                </div>
+            </section>
+
+            <section class="topic-section ml-section" id="ml-40" data-subject="machine-learning" style="display: none;">
+                <div class="topic-header">
+                    <span class="topic-number ml-advanced">ML Algorithm 40</span>
+                    <h2>🎯 Batch Normalization &amp; Dropout</h2>
+                </div>
+                <div class="content-card">
+                    <p><strong>Batch Norm:</strong> Normalizes layer inputs, stabilizes training. <strong>Dropout:</strong> Randomly drops neurons during training (p=0.5 typical), prevents overfitting.</p>
+                </div>
+                <div class="summary-card">
+                    <h3>🎯 Key Takeaways</h3>
+                    <ul>
+                        <li>Batch Norm: faster training, less sensitive to initialization</li>
+                        <li>Dropout: regularization for neural networks</li>
+                        <li>Both critical for deep learning success</li>
+                    </ul>
+                </div>
+            </section>
+
         </main>
     </div>
 

Aspect	Linear Regression	Polynomial Regression
Complexity	Simple (straight line)	Complex (curved line)
Overfitting Risk	Low	High (with high degree)
Interpretability	Very easy	Moderate
Training Speed	Very fast	Fast
Sepal Length	Sepal Width	Species
5.1	3.5	Setosa
4.9	3.0	Setosa
7.0	3.2	Versicolor
6.4	3.2	Versicolor
5.0	3.6	Setosa
Rank	Distance	Species
1	0.10	Setosa
2	0.10	Setosa
3	0.51	Setosa
4	1.43	Versicolor
5	2.02	Versicolor
Aspect	KNN	Decision Trees
Training Time	None (lazy learning)	Moderate
Prediction Time	Slow (compute all distances)	Fast (traverse tree)
Interpretability	Low	High (visual rules)
Feature Scaling	Required	Not required
Income	Credit Score	Age	Approved?
High	Good	35	Yes
High	Good	40	Yes
Low	Poor	25	No
Low	Good	30	Yes
High	Poor	45	No
Low	Poor	28	No