Spaces:

AIOmarRehan
/

Brain_Tumor_Classification_with_InceptionV3-Grad-CAM

Sleeping

App Files Files Community

AIOmarRehan commited on Nov 24, 2025

Commit

0bdf4e4

verified ·

1 Parent(s): f46707a

Update README.md

Browse files

Files changed (1) hide show

README.md +363 -3

README.md CHANGED Viewed

@@ -1,8 +1,8 @@
 ---
 title: Brain Tumor Classification With InceptionV3-Grad-CAM
-emoji: 🏢
 colorFrom: pink
-colorTo: green
 sdk: gradio
 sdk_version: 6.0.0
 app_file: app.py
@@ -10,5 +10,365 @@ pinned: false
 license: mit
 short_description: InceptionV3-based brain tumor detection with Grad-CAM.
 ---
-Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

 ---
 title: Brain Tumor Classification With InceptionV3-Grad-CAM
+emoji: 🧠
 colorFrom: pink
+colorTo: red
 sdk: gradio
 sdk_version: 6.0.0
 app_file: app.py
 license: mit
 short_description: InceptionV3-based brain tumor detection with Grad-CAM.
 ---
+# **Brain Tumor Classification Using InceptionV3 and Grad-CAM**
+A complete deep learning pipeline for **brain tumor classification** using MRI scans.
+This project demonstrates:
+* **End-to-end data preprocessing**
+* **Augmentation & dataset balancing**
+* **Efficient tf.data pipelines**
+* **Transfer learning with InceptionV3**
+* **Deep model evaluation**
+* **Grad-CAM interpretability**
+* **LaTeX mathematical explanations**
+---
+## **1. Dataset Exploration & Inspection**
+We begin by recursively scanning all MRI images and creating a structured DataFrame:
+```python
+from pathlib import Path
+import pandas as pd
+image_extensions = {'.jpg', '.jpeg', '.png'}
+paths = [
+    (path.parts[-2], path.name, str(path))
+    for path in Path("/content/my_data").rglob('*.*')
+    if path.suffix.lower() in image_extensions
+]
+df = pd.DataFrame(paths, columns=['class', 'image', 'full_path'])
+df = df.sort_values('class').reset_index(drop=True)
+df.head()
+```
+Count images per class:
+```python
+class_count = df['class'].value_counts()
+print(class_count)
+```
+### **Visualizations**
+```python
+import matplotlib.pyplot as plt
+plt.figure(figsize=(32,16))
+class_count.plot(kind='bar', edgecolor='black')
+plt.title('Number of Images per Class')
+plt.show()
+```
+### **Insights**
+* Classes are **imbalanced**
+* Images have **variable resolution**
+* Some outliers require **cleaning**
+---
+## **2. Data Cleaning & Quality Checks**
+### **Duplicate removal using MD5 hashes**
+```python
+import hashlib
+def get_hash(file_path):
+    with open(file_path, 'rb') as f:
+        return hashlib.md5(f.read()).hexdigest()
+df['file_hash'] = df['full_path'].apply(get_hash)
+df_unique = df.drop_duplicates(subset='file_hash', keep='first')
+```
+### **Additional checks**
+* Corrupted image detection
+* Resolution anomalies
+* Brightness/contrast outliers
+Cleaning ensures a **robust dataset** with minimal noise.
+---
+## **3. Data Augmentation & Class Balancing**
+Target ~200 images per class using heavy augmentation:
+```python
+from tensorflow.keras.preprocessing.image import ImageDataGenerator
+datagen = ImageDataGenerator(
+    rotation_range=20,
+    width_shift_range=0.1,
+    height_shift_range=0.1,
+    shear_range=0.1,
+    zoom_range=0.1,
+    horizontal_flip=True,
+    fill_mode='nearest'
+)
+```
+Used for minority class upsampling and preventing overfitting.
+---
+## **4. Image Preprocessing Pipeline**
+```python
+import tensorflow as tf
+def preprocess_image(path, target_size=(512, 512), augment=True):
+    img = tf.io.read_file(path)
+    img = tf.image.decode_image(img, channels=3)
+    img = tf.image.resize(img, target_size)
+    img = tf.cast(img, tf.float32) / 255.0
+    if augment:
+        img = tf.image.random_flip_left_right(img)
+        img = tf.image.random_flip_up_down(img)
+        img = tf.image.random_brightness(img, max_delta=0.1)
+        img = tf.image.random_contrast(img, 0.9, 1.1)
+    return tf.clip_by_value(img, 0.0, 1.0)
+```
+* **Train set:** augmentation enabled
+* **Validation/Test sets:** kept clean
+---
+## **5. Dataset Preparation with `tf.data`**
+```python
+AUTOTUNE = tf.data.AUTOTUNE
+batch_size = 32
+train_ds = tf.data.Dataset.from_tensor_slices((train_paths, train_labels))
+train_ds = train_ds.shuffle(len(train_paths))
+train_ds = train_ds.map(
+    lambda x, y: (preprocess_image(x, augment=True), y),
+    num_parallel_calls=AUTOTUNE
+)
+train_ds = train_ds.batch(batch_size).prefetch(AUTOTUNE)
+```
+Benefits:
+* Parallel loading
+* Smart prefetching
+* GPU utilization maximized
+---
+## **6. Model Architecture: InceptionV3**
+Transfer learning from ImageNet:
+```python
+from tensorflow.keras.applications.inception_v3 import InceptionV3
+from tensorflow.keras.layers import GlobalAveragePooling2D, Dense, Dropout
+from tensorflow.keras.models import Model
+inception = InceptionV3(input_shape=input_shape, weights='imagenet', include_top=False)
+for layer in inception.layers:
+    layer.trainable = False
+x = GlobalAveragePooling2D()(inception.output)
+x = Dense(512, activation='relu')(x)
+x = Dropout(0.5)(x)
+prediction = Dense(len(le.classes_), activation='softmax')(x)
+model = Model(inputs=inception.input, outputs=prediction)
+```
+### Why InceptionV3?
+* Factorized convolutions
+* Multi-scale feature extraction
+* Lightweight and fast
+* Strong performance in medical imaging
+---
+## **7. Training & Callbacks**
+```python
+from tensorflow.keras.callbacks import EarlyStopping, ModelCheckpoint, ReduceLROnPlateau
+model.compile(
+    loss='sparse_categorical_crossentropy',
+    optimizer='adam',
+    metrics=['accuracy']
+)
+callbacks = [
+    EarlyStopping(monitor='val_loss', patience=40, restore_best_weights=True),
+    ModelCheckpoint("best_model.h5", save_best_only=True, monitor='val_loss'),
+    ReduceLROnPlateau(monitor='val_loss', factor=0.5, patience=10, min_lr=1e-5)
+]
+```
+Training:
+```python
+history = model.fit(train_ds, validation_data=val_ds, epochs=50, callbacks=callbacks)
+```
+---
+## **8. Training Curves**
+```python
+import matplotlib.pyplot as plt
+plt.plot(history.history['accuracy'], label='Train Accuracy')
+plt.plot(history.history['val_accuracy'], label='Val Accuracy')
+plt.title('Training vs Validation Accuracy')
+plt.legend()
+plt.show()
+```
+* Curves indicate **smooth convergence**
+* Small train/val gap → **limited overfitting**
+<p align="center">
+  <img src="https://files.catbox.moe/le1mbk.png" width="100%">
+</p>
+---
+## **9. Performance Metrics**
+### Confusion Matrix
+```python
+from sklearn.metrics import confusion_matrix, ConfusionMatrixDisplay
+cm = confusion_matrix(y_true, y_pred)
+ConfusionMatrixDisplay(cm, display_labels=le.classes_).plot(cmap='Blues')
+```
+<p align="center">
+  <img src="https://files.catbox.moe/wuynop.png" width="100%">
+</p>
+### Multi-class AUC (One-vs-Rest)
+**Macro AUC formula:**
+<img src="https://latex.codecogs.com/svg.image?\color{white}\text{AUC}_{macro}=\frac{1}{K}\sum_{i=1}^{K}\text{AUC}_i"/>
+```python
+from sklearn.preprocessing import label_binarize
+from sklearn.metrics import roc_curve, auc
+y_true_bin = label_binarize(y_true, classes=np.arange(len(le.classes_)))
+```
+<p align="center">
+  <img src="https://files.catbox.moe/w3fazk.png" width="100%">
+</p>
+---
+## **10. Grad-CAM: Interpretability**
+Grad-CAM highlights regions the model uses for classification.
+### Grad-CAM heatmap:
+<img src="https://latex.codecogs.com/svg.image?\color{white}L^c_{\text{Grad-CAM}}=\text{ReLU}\left(\sum_k\alpha_k^cA^k\right)" />
+Where:
+<img src="https://latex.codecogs.com/svg.image?\color{white}%5Calpha_k%5Ec%3D%5Cfrac%7B1%7D%7BZ%7D%5Csum_%7Bi%7D%5Csum_%7Bj%7D%5Cfrac%7B%5Cpartial%20y%5Ec%7D%7B%5Cpartial%20A_%7Bij%7D%5Ek%7D" />
+Python implementation:
+```python
+def gradcam(model, img, cls=None):
+    # last conv
+    lc = next(l for l in reversed(model.layers) if "conv" in l.name.lower())
+    gm = tf.keras.Model(model.input, [lc.output, model.output])
+    with tf.GradientTape() as t:
+        conv, pred = gm(img[None])
+        cls = tf.argmax(pred[0]) if cls is None else cls
+        loss = pred[:, cls]
+    g = t.gradient(loss, conv)
+    w = tf.reduce_mean(g, axis=(0,1,2))
+    cam = tf.reduce_sum(w * conv[0], -1)
+    cam = tf.nn.relu(cam)
+    cam /= tf.reduce_max(cam) + 1e-8
+    return cam.numpy()
+```
+Visualization example:
+```python
+plt.figure(figsize=(20,10))
+for i, img in enumerate(sample_images):
+    overlay, info = VizGradCAM(model, img)
+    plt.subplot(2, 5, i+1)
+    plt.imshow(overlay)
+    plt.axis("off")
+    plt.title(f"True Label: {le.classes_[sample_labels[i]]}")
+plt.show()
+```
+<p align="center">
+  <img src="https://files.catbox.moe/ysg2yc.png" width="100%">
+</p>
+> **Note:** When the model is highly confident in a prediction, the Grad-CAM gradients become near-zero, producing little to no heatmap activation.
+---
+## **11. Technical LaTeX Notes**
+### Sparse Categorical Crossentropy
+<img src="https://latex.codecogs.com/svg.image?\color{white}L=-\frac{1}{N}\sum_{i=1}^{N}\log(p_{i,y_i})" />
+### Global Average Pooling
+<img src="https://latex.codecogs.com/svg.image?\color{white}f_c%3D%5Cfrac%7B1%7D%7Bh%20%5Ccdot%20%5Comega%7D%20%5Csum_%7Bi%3D1%7D%5E%7Bh%7D%20%5Csum_%7Bj%3D1%7D%5E%7B%5Comega%7D%20F_%7Bi%2Cj%2Cc%7D" />
+---
+## **12. Model Saving**
+```python
+model.save("InceptionV3_Brain_Tumor_MRI.h5")
+```
+---
+## **13. Results**
+> **Note:** Click the image below to view the video showcasing the project’s results.
+<a href="https://files.catbox.moe/27ct3j.mp4">
+  <img src="https://images.unsplash.com/photo-1611162616475-46b635cb6868?q=80&w=1974&auto=format&fit=crop&ixlib=rb-4.1.0&ixid=M3wxMjA3fDB8MHxwaG90by1wYWdlfHx8fGVufDB8fHx8fA%3D%3D" width="400">
+</a>
+<hr style="border-bottom: 5px solid gray; margin-top: 10px;">
+> **Note:** If the video above is not working, you can access it directly via the link below.
+[Watch Demo Video](Results/InceptionV3_Brain_Tumor_MRI.mp4)
+---
+## **Key Takeaways**
+* Strong data cleaning = reliable model
+* Heavy augmentation reduces bias
+* InceptionV3 provides excellent feature extraction
+* Evaluation metrics reveal clinical reliability
+* Grad-CAM adds essential interpretability