Upload folder using huggingface_hub

Browse files

Files changed (15) hide show

.gitignore +63 -0
README.md +173 -0
app.py +166 -0
config.py +50 -0
data_loader.py +100 -0
debug_train.py +88 -0
evaluate.py +96 -0
model.py +93 -0
requirements.txt +8 -0
static/script.js +163 -0
static/style.css +391 -0
templates/index.html +103 -0
train.py +220 -0
train_log.txt +0 -0
utils.py +186 -0

.gitignore ADDED Viewed

	@@ -0,0 +1,63 @@

+# Python
+__pycache__/
+*.py[cod]
+*$py.class
+*.so
+.Python
+build/
+develop-eggs/
+dist/
+downloads/
+eggs/
+.eggs/
+lib/
+lib64/
+parts/
+sdist/
+var/
+wheels/
+*.egg-info/
+.installed.cfg
+*.egg
+# Virtual Environment
+venv/
+ENV/
+env/
+.venv
+# PyTorch
+*.pth
+*.pt
+checkpoints/
+!checkpoints/.gitkeep
+# Data
+data/
+*.pkl
+*.pickle
+# Plots and visualizations
+plots/
+!plots/.gitkeep
+# IDE
+.vscode/
+.idea/
+*.swp
+*.swo
+*~
+# OS
+.DS_Store
+Thumbs.db
+# Jupyter Notebook
+.ipynb_checkpoints
+# Flask
+instance/
+.webassets-cache
+# Logs
+*.log

README.md CHANGED Viewed

@@ -1,3 +1,176 @@
 ---
 license: mit
 ---

 ---
 license: mit
+datasets:
+- cifar10
+metrics:
+- accuracy
+library_name: pytorch
+tags:
+- image-classification
+- sequence-classification
 ---
+# CIFAR-10 RNN Image Classifier
+An end-to-end deep learning project for classifying CIFAR-10 images using a Recurrent Neural Network (LSTM) built with PyTorch. Includes a modern web interface for real-time image classification.
+## 🌟 Features
+- **Custom RNN Architecture**: Bidirectional LSTM layers with dropout
+- **Complete Training Pipeline**: Automated training with validation, checkpointing, and visualization
+- **Comprehensive Evaluation**: Confusion matrix, classification reports, and prediction visualizations
+- **Modern Web Interface**: Beautiful Flask web app for real-time image classification
+- **CIFAR-10 Dataset**: Automatically downloads and preprocesses the dataset
+## 📊 Model Architecture
+The model treats each 32x32 RGB image as a sequence of 32 rows, where each row has 96 features (32 pixels * 3 channels).
+```
+Input (Batch, 3, 32, 32)
+    ↓
+Reshape (Batch, 32, 96)
+    ↓
+Bidirectional LSTM (Hidden: 256, Layers: 2, Dropout: 0.2)
+    ↓
+Last Time Step Output
+    ↓
+Fully Connected (512) → ReLU → Dropout(0.3)
+    ↓
+Output (10 classes)
+```
+## 🚀 Quick Start
+### 1. Install Dependencies
+```bash
+pip install -r requirements.txt
+```
+### 2. Train the Model
+```bash
+python train.py
+```
+This will:
+- Download the CIFAR-10 dataset automatically
+- Train the model for 50 epochs
+- Save checkpoints in `./checkpoints/`
+- Generate training plots in `./plots/`
+### 3. Evaluate the Model
+```bash
+python evaluate.py
+```
+This will:
+- Load the best model checkpoint
+- Evaluate on the test set
+- Generate confusion matrix
+- Create prediction visualizations
+### 4. Run the Web Application
+```bash
+python app.py
+```
+Then open your browser and navigate to `http://localhost:5000`
+## 📁 Project Structure
+```
+CNN/
+├── config.py              # Configuration and hyperparameters
+├── data_loader.py         # Data loading and preprocessing
+├── model.py               # CNN model architecture
+├── train.py               # Training script
+├── evaluate.py            # Evaluation script
+├── utils.py               # Utility functions
+├── app.py                 # Flask web application
+├── requirements.txt       # Python dependencies
+├── templates/
+│   └── index.html        # Web interface HTML
+├── static/
+│   ├── style.css         # Web interface CSS
+│   └── script.js         # Web interface JavaScript
+├── checkpoints/          # Model checkpoints (created during training)
+├── plots/                # Training visualizations (created during training)
+└── data/                 # CIFAR-10 dataset (downloaded automatically)
+```
+## 🎯 CIFAR-10 Classes
+The model classifies images into 10 categories:
+1. Airplane
+2. Automobile
+3. Bird
+4. Cat
+5. Deer
+6. Dog
+7. Frog
+8. Horse
+9. Ship
+10. Truck
+## ⚙️ Configuration
+Edit `config.py` to customize:
+- **Training**: epochs, batch size, learning rate
+- **Model**: number of classes, architecture parameters
+- **Data**: augmentation settings, normalization values
+- **Paths**: checkpoint and plot directories
+## 📈 Training Details
+- **Optimizer**: SGD with momentum (0.9) and weight decay (5e-4)
+- **Loss Function**: Cross-Entropy Loss
+- **Learning Rate**: 0.001 with step decay
+- **Batch Size**: 128
+- **Data Augmentation**: Random crop and horizontal flip
+- **Regularization**: Batch normalization and dropout
+## 🎨 Web Interface Features
+- **Drag & Drop**: Upload images via drag-and-drop
+- **Random Samples**: Test with random CIFAR-10 images
+- **Real-time Classification**: Instant predictions with confidence scores
+- **Top-5 Predictions**: View probability distribution
+- **Modern UI**: Dark theme with smooth animations
+## 📊 Expected Performance
+With the default configuration, the model typically achieves:
+- **Training Accuracy**: ~90%
+- **Validation Accuracy**: ~85%
+## 🛠️ Requirements
+- Python 3.7+
+- PyTorch 2.0+
+- torchvision
+- Flask
+- NumPy
+- Matplotlib
+- scikit-learn
+- Pillow
+- tqdm
+## 📝 License
+This project is open source and available for educational purposes.
+## 🤝 Contributing
+Feel free to fork this project and submit pull requests for improvements!
+## 📧 Contact
+For questions or feedback, please open an issue on the repository.
+---
+**Built with ❤️ using PyTorch and Flask**

app.py ADDED Viewed

	@@ -0,0 +1,166 @@

+"""
+Flask web application for CIFAR-10 image classification
+"""
+import os
+import io
+import base64
+import torch
+from PIL import Image
+from flask import Flask, render_template, request, jsonify
+import torchvision.transforms as transforms
+import numpy as np
+import config
+from model import get_model
+from utils import load_checkpoint
+app = Flask(__name__)
+# Global model variable
+model = None
+def load_model():
+    """Load the trained model"""
+    global model
+    if not os.path.exists(config.BEST_MODEL_PATH):
+        print(f"Warning: Model checkpoint not found at {config.BEST_MODEL_PATH}")
+        return False
+    model = get_model(num_classes=config.NUM_CLASSES, device=config.DEVICE)
+    epoch, accuracy = load_checkpoint(model, None, config.BEST_MODEL_PATH)
+    model.eval()
+    print(f"Model loaded from epoch {epoch + 1} with accuracy: {accuracy:.2f}%")
+    return True
+def preprocess_image(image):
+    """
+    Preprocess image for model prediction
+    Args:
+        image: PIL Image
+    Returns:
+        torch.Tensor: Preprocessed image tensor
+    """
+    transform = transforms.Compose([
+        transforms.Resize((32, 32)),
+        transforms.ToTensor(),
+        transforms.Normalize(
+            mean=[0.4914, 0.4822, 0.4465],
+            std=[0.2470, 0.2435, 0.2616]
+        )
+    ])
+    return transform(image).unsqueeze(0)
+@app.route('/')
+def index():
+    """Render the main page"""
+    return render_template('index.html', class_names=config.CLASS_NAMES)
+@app.route('/predict', methods=['POST'])
+def predict():
+    """Handle prediction requests"""
+    if model is None:
+        return jsonify({'error': 'Model not loaded'}), 500
+    try:
+        # Get image from request
+        if 'file' not in request.files:
+            return jsonify({'error': 'No file provided'}), 400
+        file = request.files['file']
+        if file.filename == '':
+            return jsonify({'error': 'No file selected'}), 400
+        # Read and preprocess image
+        image = Image.open(file.stream).convert('RGB')
+        input_tensor = preprocess_image(image).to(config.DEVICE)
+        # Make prediction
+        with torch.no_grad():
+            output = model(input_tensor)
+            probabilities = torch.nn.functional.softmax(output[0], dim=0)
+            confidence, predicted = torch.max(probabilities, 0)
+        # Get top 5 predictions
+        top5_prob, top5_idx = torch.topk(probabilities, 5)
+        top5_predictions = [
+            {
+                'class': config.CLASS_NAMES[idx],
+                'probability': float(prob * 100)
+            }
+            for idx, prob in zip(top5_idx.cpu().numpy(), top5_prob.cpu().numpy())
+        ]
+        # Prepare response
+        response = {
+            'predicted_class': config.CLASS_NAMES[predicted.item()],
+            'confidence': float(confidence.item() * 100),
+            'top5_predictions': top5_predictions
+        }
+        return jsonify(response)
+    except Exception as e:
+        return jsonify({'error': str(e)}), 500
+@app.route('/random_sample', methods=['GET'])
+def random_sample():
+    """Get a random sample from CIFAR-10 test set or generate dummy if missing"""
+    try:
+        from data_loader import get_data_loaders
+        # Check if dataset exists
+        dataset_path = os.path.join(config.DATA_DIR, 'cifar-10-batches-py')
+        if os.path.exists(dataset_path):
+            _, test_loader = get_data_loaders()
+            dataset = test_loader.dataset
+            idx = np.random.randint(0, len(dataset))
+            image, label = dataset[idx]
+            # Denormalize image
+            mean = torch.tensor([0.4914, 0.4822, 0.4465]).view(3, 1, 1)
+            std = torch.tensor([0.2470, 0.2435, 0.2616]).view(3, 1, 1)
+            image_denorm = image * std + mean
+            image_denorm = torch.clamp(image_denorm, 0, 1)
+            # Convert to PIL Image
+            image_np = (image_denorm.numpy().transpose(1, 2, 0) * 255).astype(np.uint8)
+            label_name = config.CLASS_NAMES[label]
+        else:
+            # Generate dummy image for demonstration
+            image_np = np.random.randint(0, 256, (32, 32, 3), dtype=np.uint8)
+            label_name = "Dummy Sample (Dataset still downloading)"
+        pil_image = Image.fromarray(image_np)
+        # Convert to base64
+        buffered = io.BytesIO()
+        pil_image.save(buffered, format="PNG")
+        img_str = base64.b64encode(buffered.getvalue()).decode()
+        return jsonify({
+            'image': f'data:image/png;base64,{img_str}',
+            'true_label': label_name
+        })
+    except Exception as e:
+        return jsonify({'error': str(e)}), 500
+if __name__ == '__main__':
+    # Load model
+    if load_model():
+        print("Starting Flask application...")
+        app.run(debug=True, host='0.0.0.0', port=5000)
+    else:
+        print("Failed to load model. Please train the model first using train.py")

config.py ADDED Viewed

	@@ -0,0 +1,50 @@

+"""
+Configuration file for CIFAR-10 CNN project
+"""
+import torch
+# Data configuration
+DATA_DIR = './data'
+BATCH_SIZE = 32
+NUM_WORKERS = 0  # Set to 0 for better stability on some systems without GPU
+# Model configuration
+NUM_CLASSES = 10
+INPUT_CHANNELS = 3
+IMAGE_SIZE = 32
+HIDDEN_SIZE = 256
+NUM_LAYERS = 2
+RNN_DROPOUT = 0.2
+# Training configuration
+EPOCHS = 5
+LEARNING_RATE = 0.01  # Increased slightly for faster convergence in few epochs
+WEIGHT_DECAY = 5e-4
+MOMENTUM = 0.9
+# Device configuration
+DEVICE = torch.device('cuda' if torch.cuda.is_available() else 'cpu')
+# Checkpoint configuration
+CHECKPOINT_DIR = './checkpoints'
+BEST_MODEL_PATH = './checkpoints/best_model.pth'
+LAST_MODEL_PATH = './checkpoints/last_model.pth'
+# Visualization configuration
+PLOTS_DIR = './plots'
+# CIFAR-10 class names
+CLASS_NAMES = [
+    'airplane', 'automobile', 'bird', 'cat', 'deer',
+    'dog', 'frog', 'horse', 'ship', 'truck'
+]
+# Data augmentation settings
+USE_AUGMENTATION = True
+RANDOM_CROP_PADDING = 4
+RANDOM_HORIZONTAL_FLIP = 0.5
+# Learning rate scheduler
+USE_SCHEDULER = True
+SCHEDULER_STEP_SIZE = 20
+SCHEDULER_GAMMA = 0.1

data_loader.py ADDED Viewed

	@@ -0,0 +1,100 @@

+"""
+Data loading and preprocessing for CIFAR-10 dataset
+"""
+import torch
+from torch.utils.data import DataLoader
+from torchvision import datasets, transforms
+import config
+def get_transforms(train=True):
+    """
+    Get data transformations for training or testing
+    Args:
+        train (bool): If True, returns training transforms with augmentation
+    Returns:
+        torchvision.transforms.Compose: Composed transforms
+    """
+    if train and config.USE_AUGMENTATION:
+        transform = transforms.Compose([
+            transforms.RandomCrop(32, padding=config.RANDOM_CROP_PADDING),
+            transforms.RandomHorizontalFlip(p=config.RANDOM_HORIZONTAL_FLIP),
+            transforms.ToTensor(),
+            transforms.Normalize(
+                mean=[0.4914, 0.4822, 0.4465],
+                std=[0.2470, 0.2435, 0.2616]
+            )
+        ])
+    else:
+        transform = transforms.Compose([
+            transforms.ToTensor(),
+            transforms.Normalize(
+                mean=[0.4914, 0.4822, 0.4465],
+                std=[0.2470, 0.2435, 0.2616]
+            )
+        ])
+    return transform
+def get_data_loaders():
+    """
+    Create train and test data loaders for CIFAR-10
+    Returns:
+        tuple: (train_loader, test_loader)
+    """
+    # Get transforms
+    train_transform = get_transforms(train=True)
+    test_transform = get_transforms(train=False)
+    # Load datasets
+    train_dataset = datasets.CIFAR10(
+        root=config.DATA_DIR,
+        train=True,
+        download=True,
+        transform=train_transform
+    )
+    test_dataset = datasets.CIFAR10(
+        root=config.DATA_DIR,
+        train=False,
+        download=True,
+        transform=test_transform
+    )
+    # Create data loaders
+    train_loader = DataLoader(
+        train_dataset,
+        batch_size=config.BATCH_SIZE,
+        shuffle=True,
+        num_workers=config.NUM_WORKERS,
+        pin_memory=True if config.DEVICE.type == 'cuda' else False
+    )
+    test_loader = DataLoader(
+        test_dataset,
+        batch_size=config.BATCH_SIZE,
+        shuffle=False,
+        num_workers=config.NUM_WORKERS,
+        pin_memory=True if config.DEVICE.type == 'cuda' else False
+    )
+    return train_loader, test_loader
+def denormalize(tensor):
+    """
+    Denormalize a tensor image for visualization
+    Args:
+        tensor: Normalized tensor image
+    Returns:
+        tensor: Denormalized tensor image
+    """
+    mean = torch.tensor([0.4914, 0.4822, 0.4465]).view(3, 1, 1)
+    std = torch.tensor([0.2470, 0.2435, 0.2616]).view(3, 1, 1)
+    return tensor * std + mean

debug_train.py ADDED Viewed

	@@ -0,0 +1,88 @@

+"""
+Debug training script using dummy data to test the pipeline without downloading CIFAR-10
+"""
+import os
+import torch
+import torch.nn as nn
+import torch.optim as optim
+from torch.utils.data import DataLoader, TensorDataset
+from tqdm import tqdm
+import config
+from model import get_model, count_parameters
+from utils import save_checkpoint, plot_training_history
+def get_dummy_data_loaders():
+    """Create dummy data loaders for testing"""
+    # Create random images (32x32) and labels (0-9)
+    train_size = 100
+    test_size = 20
+    train_images = torch.randn(train_size, 3, 32, 32)
+    train_labels = torch.randint(0, 10, (train_size,))
+    test_images = torch.randn(test_size, 3, 32, 32)
+    test_labels = torch.randint(0, 10, (test_size,))
+    train_dataset = TensorDataset(train_images, train_labels)
+    test_dataset = TensorDataset(test_images, test_labels)
+    train_loader = DataLoader(train_dataset, batch_size=config.BATCH_SIZE, shuffle=True)
+    test_loader = DataLoader(test_dataset, batch_size=config.BATCH_SIZE, shuffle=False)
+    return train_loader, test_loader
+def debug_train():
+    """Debug training function"""
+    os.makedirs(config.CHECKPOINT_DIR, exist_ok=True)
+    os.makedirs(config.PLOTS_DIR, exist_ok=True)
+    print("Creating dummy data loaders...")
+    train_loader, test_loader = get_dummy_data_loaders()
+    print(f"Creating model on {config.DEVICE}...")
+    model = get_model(num_classes=config.NUM_CLASSES, device=config.DEVICE)
+    criterion = nn.CrossEntropyLoss()
+    optimizer = optim.SGD(model.parameters(), lr=0.01)
+    history = {'train_loss': [], 'train_acc': [], 'val_loss': [], 'val_acc': []}
+    print("Starting debug training for 2 epochs...")
+    for epoch in range(2):
+        model.train()
+        running_loss = 0.0
+        correct = 0
+        total = 0
+        for inputs, labels in tqdm(train_loader, desc=f"Epoch {epoch+1}"):
+            inputs, labels = inputs.to(config.DEVICE), labels.to(config.DEVICE)
+            optimizer.zero_grad()
+            outputs = model(inputs)
+            loss = criterion(outputs, labels)
+            loss.backward()
+            optimizer.step()
+            running_loss += loss.item()
+            _, predicted = outputs.max(1)
+            total += labels.size(0)
+            correct += predicted.eq(labels).sum().item()
+        train_loss = running_loss / len(train_loader)
+        train_acc = 100. * correct / total
+        history['train_loss'].append(train_loss)
+        history['train_acc'].append(train_acc)
+        history['val_loss'].append(train_loss) # Just use train loss for dummy validation
+        history['val_acc'].append(train_acc)
+        print(f"Epoch {epoch+1}: Loss {train_loss:.4f}, Acc {train_acc:.2f}%")
+        # Save "best" model for app testing
+        save_checkpoint(model, optimizer, epoch, train_acc, config.BEST_MODEL_PATH)
+        plot_training_history(history, config.PLOTS_DIR)
+    print("\nDebug training complete. 'best_model.pth' created for testing the web app.")
+if __name__ == "__main__":
+    debug_train()

evaluate.py ADDED Viewed

	@@ -0,0 +1,96 @@

+"""
+Evaluation script for CIFAR-10 CNN
+"""
+import os
+import torch
+from tqdm import tqdm
+import config
+from model import get_model
+from data_loader import get_data_loaders
+from utils import (
+    load_checkpoint, plot_confusion_matrix,
+    print_classification_report, visualize_predictions
+)
+def evaluate():
+    """
+    Evaluate the trained model
+    """
+    # Create plots directory
+    os.makedirs(config.PLOTS_DIR, exist_ok=True)
+    # Get data loaders
+    print("Loading CIFAR-10 dataset...")
+    _, test_loader = get_data_loaders()
+    print(f"Test samples: {len(test_loader.dataset)}")
+    # Create model
+    print(f"\nLoading model from {config.BEST_MODEL_PATH}")
+    model = get_model(num_classes=config.NUM_CLASSES, device=config.DEVICE)
+    # Load checkpoint
+    if not os.path.exists(config.BEST_MODEL_PATH):
+        print(f"Error: Model checkpoint not found at {config.BEST_MODEL_PATH}")
+        print("Please train the model first using train.py")
+        return
+    epoch, accuracy = load_checkpoint(model, None, config.BEST_MODEL_PATH)
+    print(f"Loaded model from epoch {epoch + 1} with accuracy: {accuracy:.2f}%")
+    # Evaluate
+    model.eval()
+    correct = 0
+    total = 0
+    all_predictions = []
+    all_labels = []
+    print("\nEvaluating model...")
+    with torch.no_grad():
+        pbar = tqdm(test_loader, desc='Evaluating')
+        for inputs, labels in pbar:
+            inputs, labels = inputs.to(config.DEVICE), labels.to(config.DEVICE)
+            # Forward pass
+            outputs = model(inputs)
+            _, predicted = outputs.max(1)
+            # Statistics
+            total += labels.size(0)
+            correct += predicted.eq(labels).sum().item()
+            # Store predictions and labels
+            all_predictions.extend(predicted.cpu().numpy())
+            all_labels.extend(labels.cpu().numpy())
+            # Update progress bar
+            pbar.set_postfix({'acc': f'{100. * correct / total:.2f}%'})
+    # Calculate final accuracy
+    final_accuracy = 100. * correct / total
+    # Print results
+    print("\n" + "=" * 80)
+    print(f"Final Test Accuracy: {final_accuracy:.2f}%")
+    print(f"Correct predictions: {correct}/{total}")
+    print("=" * 80)
+    # Print classification report
+    print_classification_report(all_labels, all_predictions)
+    # Plot confusion matrix
+    print("\nGenerating confusion matrix...")
+    cm_path = os.path.join(config.PLOTS_DIR, 'confusion_matrix.png')
+    plot_confusion_matrix(all_labels, all_predictions, cm_path)
+    print(f"Confusion matrix saved to {cm_path}")
+    # Visualize predictions
+    print("\nGenerating prediction visualizations...")
+    visualize_predictions(model, test_loader, config.DEVICE, num_images=16)
+    print("\nEvaluation completed!")
+if __name__ == '__main__':
+    evaluate()

model.py ADDED Viewed

	@@ -0,0 +1,93 @@

+"""
+RNN Model Architecture for CIFAR-10 Classification
+"""
+import torch
+import torch.nn as nn
+import config
+class CIFAR10RNN(nn.Module):
+    """
+    Recurrent Neural Network (LSTM) for CIFAR-10 classification
+    Architecture:
+    - Input sequence: 32 rows of 32x3 pixels (= 96 features per step)
+    - Bidirectional LSTM layers
+    - Fully connected layer for classification
+    """
+    def __init__(self, input_size=96, hidden_size=256, num_layers=2, num_classes=10):
+        super(CIFAR10RNN, self).__init__()
+        self.hidden_size = hidden_size
+        self.num_layers = num_layers
+        # LSTM Layer
+        # batch_first=True means input shape is (batch, seq, feature)
+        self.lstm = nn.LSTM(
+            input_size,
+            hidden_size,
+            num_layers,
+            batch_first=True,
+            bidirectional=True,
+            dropout=config.RNN_DROPOUT if num_layers > 1 else 0
+        )
+        # Fully Connected Layer
+        # * 2 because of bidirectional
+        self.fc = nn.Sequential(
+            nn.Linear(hidden_size * 2, 512),
+            nn.ReLU(),
+            nn.Dropout(0.3),
+            nn.Linear(512, num_classes)
+        )
+    def forward(self, x):
+        # x shape: (batch, 3, 32, 32)
+        # Convert to: (batch, seq_len=32, input_size=96)
+        batch_size = x.size(0)
+        # Rearrange image rows into a sequence
+        # (batch, 3, 32, 32) -> (batch, 32, 3, 32) -> (batch, 32, 96)
+        x = x.permute(0, 2, 1, 3).contiguous()
+        x = x.view(batch_size, 32, -1)
+        # LSTM Forward pass
+        # out: tensor of shape (batch, seq_len, hidden_size * 2)
+        out, _ = self.lstm(x)
+        # Take the output of the last time step
+        out = out[:, -1, :]
+        # Classification
+        out = self.fc(out)
+        return out
+def get_model(num_classes=10, device='cpu'):
+    """
+    Create and return the RNN model
+    Args:
+        num_classes (int): Number of output classes
+        device (str or torch.device): Device to load the model on
+    Returns:
+        CIFAR10RNN: The RNN model
+    """
+    model = CIFAR10RNN(
+        input_size=32*3,
+        hidden_size=config.HIDDEN_SIZE,
+        num_layers=config.NUM_LAYERS,
+        num_classes=num_classes
+    )
+    model = model.to(device)
+    return model
+def count_parameters(model):
+    """
+    Count the number of trainable parameters in the model
+    """
+    return sum(p.numel() for p in model.parameters() if p.requires_grad)

requirements.txt ADDED Viewed

	@@ -0,0 +1,8 @@

+torch>=2.0.0
+torchvision>=0.15.0
+numpy>=1.24.0
+matplotlib>=3.7.0
+pillow>=9.5.0
+flask>=2.3.0
+tqdm>=4.65.0
+scikit-learn>=1.3.0

static/script.js ADDED Viewed

	@@ -0,0 +1,163 @@

+// CIFAR-10 Classifier JavaScript
+let selectedFile = null;
+// DOM Elements
+const uploadArea = document.getElementById('uploadArea');
+const fileInput = document.getElementById('fileInput');
+const previewCard = document.getElementById('previewCard');
+const previewImage = document.getElementById('previewImage');
+const classifyBtn = document.getElementById('classifyBtn');
+const randomBtn = document.getElementById('randomBtn');
+const resultsSection = document.getElementById('resultsSection');
+const loadingOverlay = document.getElementById('loadingOverlay');
+// Upload area click handler
+uploadArea.addEventListener('click', () => {
+    fileInput.click();
+});
+// File input change handler
+fileInput.addEventListener('change', (e) => {
+    const file = e.target.files[0];
+    if (file) {
+        handleFile(file);
+    }
+});
+// Drag and drop handlers
+uploadArea.addEventListener('dragover', (e) => {
+    e.preventDefault();
+    uploadArea.classList.add('drag-over');
+});
+uploadArea.addEventListener('dragleave', () => {
+    uploadArea.classList.remove('drag-over');
+});
+uploadArea.addEventListener('drop', (e) => {
+    e.preventDefault();
+    uploadArea.classList.remove('drag-over');
+    const file = e.dataTransfer.files[0];
+    if (file && file.type.startsWith('image/')) {
+        handleFile(file);
+    }
+});
+// Handle file selection
+function handleFile(file) {
+    selectedFile = file;
+    const reader = new FileReader();
+    reader.onload = (e) => {
+        previewImage.src = e.target.result;
+        previewCard.style.display = 'block';
+        resultsSection.style.display = 'none';
+    };
+    reader.readAsDataURL(file);
+}
+// Classify button handler
+classifyBtn.addEventListener('click', async () => {
+    if (!selectedFile) return;
+    const formData = new FormData();
+    formData.append('file', selectedFile);
+    try {
+        loadingOverlay.style.display = 'flex';
+        const response = await fetch('/predict', {
+            method: 'POST',
+            body: formData
+        });
+        const data = await response.json();
+        if (data.error) {
+            alert('Error: ' + data.error);
+            return;
+        }
+        displayResults(data);
+    } catch (error) {
+        alert('Error: ' + error.message);
+    } finally {
+        loadingOverlay.style.display = 'none';
+    }
+});
+// Random sample button handler
+randomBtn.addEventListener('click', async () => {
+    try {
+        loadingOverlay.style.display = 'flex';
+        const response = await fetch('/random_sample');
+        const data = await response.json();
+        if (data.error) {
+            alert('Error: ' + data.error);
+            return;
+        }
+        // Convert base64 to blob
+        const blob = await fetch(data.image).then(r => r.blob());
+        const file = new File([blob], 'random_sample.png', { type: 'image/png' });
+        handleFile(file);
+    } catch (error) {
+        alert('Error: ' + error.message);
+    } finally {
+        loadingOverlay.style.display = 'none';
+    }
+});
+// Display classification results
+function displayResults(data) {
+    document.getElementById('predictedClass').textContent = data.predicted_class;
+    document.getElementById('confidenceValue').textContent = data.confidence.toFixed(2) + '%';
+    // Update confidence badge color based on confidence level
+    const badge = document.getElementById('confidenceBadge');
+    if (data.confidence >= 80) {
+        badge.style.background = 'rgba(79, 172, 254, 0.2)';
+        badge.style.borderColor = 'rgba(79, 172, 254, 0.4)';
+        badge.style.color = '#4facfe';
+    } else if (data.confidence >= 60) {
+        badge.style.background = 'rgba(240, 147, 251, 0.2)';
+        badge.style.borderColor = 'rgba(240, 147, 251, 0.4)';
+        badge.style.color = '#f093fb';
+    } else {
+        badge.style.background = 'rgba(245, 87, 108, 0.2)';
+        badge.style.borderColor = 'rgba(245, 87, 108, 0.4)';
+        badge.style.color = '#f5576c';
+    }
+    // Display top 5 predictions
+    const top5Container = document.getElementById('top5Container');
+    top5Container.innerHTML = '';
+    data.top5_predictions.forEach((pred, index) => {
+        const item = document.createElement('div');
+        item.className = 'prediction-item';
+        item.style.animationDelay = `${index * 0.1}s`;
+        item.innerHTML = `
+            <span class="prediction-item-name">${pred.class}</span>
+            <div class="prediction-item-bar">
+                <div class="prediction-item-fill" style="width: ${pred.probability}%"></div>
+            </div>
+            <span class="prediction-item-value">${pred.probability.toFixed(2)}%</span>
+        `;
+        top5Container.appendChild(item);
+    });
+    resultsSection.style.display = 'grid';
+    // Scroll to results
+    resultsSection.scrollIntoView({ behavior: 'smooth', block: 'nearest' });
+}

static/style.css ADDED Viewed

	@@ -0,0 +1,391 @@

+/* Modern CSS for CIFAR-10 Classifier */
+:root {
+    --primary-gradient: linear-gradient(135deg, #667eea 0%, #764ba2 100%);
+    --secondary-gradient: linear-gradient(135deg, #f093fb 0%, #f5576c 100%);
+    --success-gradient: linear-gradient(135deg, #4facfe 0%, #00f2fe 100%);
+    --bg-primary: #0a0e27;
+    --bg-secondary: #151932;
+    --bg-card: rgba(255, 255, 255, 0.05);
+    --bg-card-hover: rgba(255, 255, 255, 0.08);
+    --text-primary: #ffffff;
+    --text-secondary: #a0aec0;
+    --text-muted: #718096;
+    --border-color: rgba(255, 255, 255, 0.1);
+    --shadow-lg: 0 8px 32px rgba(0, 0, 0, 0.3);
+    --spacing-xs: 0.5rem;
+    --spacing-sm: 1rem;
+    --spacing-md: 1.5rem;
+    --spacing-lg: 2rem;
+    --spacing-xl: 3rem;
+    --radius-sm: 8px;
+    --radius-md: 12px;
+    --radius-lg: 16px;
+    --radius-xl: 24px;
+}
+* {
+    margin: 0;
+    padding: 0;
+    box-sizing: border-box;
+}
+body {
+    font-family: 'Inter', -apple-system, BlinkMacSystemFont, 'Segoe UI', sans-serif;
+    background: var(--bg-primary);
+    color: var(--text-primary);
+    line-height: 1.6;
+    min-height: 100vh;
+}
+body::before {
+    content: '';
+    position: fixed;
+    top: 0;
+    left: 0;
+    width: 100%;
+    height: 100%;
+    background: radial-gradient(circle at 20% 50%, rgba(102, 126, 234, 0.1) 0%, transparent 50%),
+                radial-gradient(circle at 80% 80%, rgba(118, 75, 162, 0.1) 0%, transparent 50%);
+    z-index: -1;
+}
+.container {
+    max-width: 1400px;
+    margin: 0 auto;
+    padding: var(--spacing-lg);
+}
+.header {
+    text-align: center;
+    margin-bottom: var(--spacing-xl);
+    padding: var(--spacing-xl) 0;
+}
+.title {
+    font-size: 3.5rem;
+    font-weight: 700;
+    margin-bottom: var(--spacing-sm);
+}
+.gradient-text {
+    background: var(--primary-gradient);
+    -webkit-background-clip: text;
+    -webkit-text-fill-color: transparent;
+    background-clip: text;
+}
+.subtitle {
+    font-size: 1.25rem;
+    color: var(--text-secondary);
+}
+.upload-section {
+    display: grid;
+    grid-template-columns: 1fr 1fr;
+    gap: var(--spacing-lg);
+    margin-bottom: var(--spacing-xl);
+}
+.card {
+    background: var(--bg-card);
+    backdrop-filter: blur(10px);
+    border: 1px solid var(--border-color);
+    border-radius: var(--radius-lg);
+    padding: var(--spacing-lg);
+    transition: all 0.3s ease;
+}
+.card:hover {
+    background: var(--bg-card-hover);
+    transform: translateY(-2px);
+    box-shadow: var(--shadow-lg);
+}
+.card-title {
+    font-size: 1.5rem;
+    font-weight: 600;
+    margin-bottom: var(--spacing-md);
+}
+.upload-area {
+    border: 2px dashed var(--border-color);
+    border-radius: var(--radius-md);
+    padding: var(--spacing-xl);
+    text-align: center;
+    cursor: pointer;
+    transition: all 0.3s ease;
+    margin-bottom: var(--spacing-md);
+}
+.upload-area:hover {
+    border-color: #667eea;
+    background: rgba(102, 126, 234, 0.05);
+}
+.upload-icon {
+    width: 64px;
+    height: 64px;
+    margin: 0 auto var(--spacing-md);
+    color: #667eea;
+}
+.upload-text {
+    font-size: 1.125rem;
+    font-weight: 500;
+    margin-bottom: var(--spacing-xs);
+}
+.upload-subtext {
+    font-size: 0.875rem;
+    color: var(--text-muted);
+}
+.image-preview {
+    width: 100%;
+    height: 300px;
+    border-radius: var(--radius-md);
+    overflow: hidden;
+    margin-bottom: var(--spacing-md);
+    background: var(--bg-secondary);
+    display: flex;
+    align-items: center;
+    justify-content: center;
+}
+.image-preview img {
+    max-width: 100%;
+    max-height: 100%;
+    object-fit: contain;
+}
+.btn {
+    display: inline-flex;
+    align-items: center;
+    justify-content: center;
+    gap: var(--spacing-xs);
+    padding: 0.875rem 1.75rem;
+    font-size: 1rem;
+    font-weight: 600;
+    border: none;
+    border-radius: var(--radius-md);
+    cursor: pointer;
+    transition: all 0.3s ease;
+    width: 100%;
+}
+.btn svg {
+    width: 20px;
+    height: 20px;
+}
+.btn-primary {
+    background: var(--primary-gradient);
+    color: white;
+}
+.btn-primary:hover {
+    transform: translateY(-2px);
+    box-shadow: 0 8px 24px rgba(102, 126, 234, 0.4);
+}
+.btn-secondary {
+    background: var(--bg-secondary);
+    color: var(--text-primary);
+    border: 1px solid var(--border-color);
+}
+.btn-secondary:hover {
+    background: var(--bg-card);
+    border-color: #667eea;
+}
+.results-section {
+    display: grid;
+    grid-template-columns: 2fr 1fr;
+    gap: var(--spacing-lg);
+}
+.prediction-main {
+    text-align: center;
+    padding: var(--spacing-lg);
+    background: var(--bg-secondary);
+    border-radius: var(--radius-md);
+    margin-bottom: var(--spacing-lg);
+}
+.prediction-label {
+    font-size: 0.875rem;
+    text-transform: uppercase;
+    letter-spacing: 0.1em;
+    color: var(--text-muted);
+    margin-bottom: var(--spacing-sm);
+}
+.prediction-class {
+    font-size: 2.5rem;
+    font-weight: 700;
+    background: var(--success-gradient);
+    -webkit-background-clip: text;
+    -webkit-text-fill-color: transparent;
+    background-clip: text;
+    margin-bottom: var(--spacing-md);
+}
+.confidence-badge {
+    display: inline-block;
+    padding: var(--spacing-xs) var(--spacing-md);
+    background: rgba(79, 172, 254, 0.2);
+    border: 1px solid rgba(79, 172, 254, 0.4);
+    border-radius: var(--radius-xl);
+    font-size: 0.875rem;
+    font-weight: 600;
+    color: #4facfe;
+}
+.predictions-title {
+    font-size: 1.125rem;
+    font-weight: 600;
+    margin-bottom: var(--spacing-md);
+    color: var(--text-secondary);
+}
+.prediction-item {
+    display: flex;
+    align-items: center;
+    justify-content: space-between;
+    padding: var(--spacing-sm);
+    background: var(--bg-secondary);
+    border-radius: var(--radius-sm);
+    margin-bottom: var(--spacing-sm);
+    transition: all 0.3s ease;
+}
+.prediction-item:hover {
+    background: var(--bg-card);
+    transform: translateX(4px);
+}
+.prediction-item-name {
+    font-weight: 500;
+}
+.prediction-item-bar {
+    flex: 1;
+    height: 8px;
+    background: var(--bg-primary);
+    border-radius: var(--radius-xl);
+    margin: 0 var(--spacing-md);
+    overflow: hidden;
+}
+.prediction-item-fill {
+    height: 100%;
+    background: var(--primary-gradient);
+    border-radius: var(--radius-xl);
+    transition: width 0.8s ease;
+}
+.prediction-item-value {
+    font-weight: 600;
+    color: var(--text-secondary);
+    min-width: 50px;
+    text-align: right;
+}
+.info-title {
+    font-size: 1.125rem;
+    font-weight: 600;
+    margin-bottom: var(--spacing-md);
+    color: var(--text-secondary);
+}
+.classes-grid {
+    display: grid;
+    grid-template-columns: 1fr 1fr;
+    gap: var(--spacing-sm);
+}
+.class-item {
+    display: flex;
+    align-items: center;
+    gap: var(--spacing-sm);
+    padding: var(--spacing-sm);
+    background: var(--bg-secondary);
+    border-radius: var(--radius-sm);
+    transition: all 0.3s ease;
+}
+.class-item:hover {
+    background: var(--bg-card);
+    transform: translateX(4px);
+}
+.class-icon {
+    width: 32px;
+    height: 32px;
+    display: flex;
+    align-items: center;
+    justify-content: center;
+    background: var(--primary-gradient);
+    border-radius: var(--radius-sm);
+    font-weight: 600;
+    font-size: 0.875rem;
+}
+.class-name {
+    font-weight: 500;
+    text-transform: capitalize;
+}
+.loading-overlay {
+    position: fixed;
+    top: 0;
+    left: 0;
+    width: 100%;
+    height: 100%;
+    background: rgba(10, 14, 39, 0.9);
+    backdrop-filter: blur(8px);
+    display: flex;
+    flex-direction: column;
+    align-items: center;
+    justify-content: center;
+    z-index: 1000;
+}
+.spinner {
+    width: 64px;
+    height: 64px;
+    border: 4px solid var(--border-color);
+    border-top-color: #667eea;
+    border-radius: 50%;
+    animation: spin 1s linear infinite;
+}
+@keyframes spin {
+    to { transform: rotate(360deg); }
+}
+.loading-text {
+    margin-top: var(--spacing-md);
+    font-size: 1.125rem;
+    color: var(--text-secondary);
+}
+.footer {
+    text-align: center;
+    padding: var(--spacing-xl) 0;
+    color: var(--text-muted);
+    font-size: 0.875rem;
+    border-top: 1px solid var(--border-color);
+    margin-top: var(--spacing-xl);
+}
+@media (max-width: 1024px) {
+    .upload-section, .results-section {
+        grid-template-columns: 1fr;
+    }
+    .title {
+        font-size: 2.5rem;
+    }
+}

templates/index.html ADDED Viewed

	@@ -0,0 +1,103 @@

+<!DOCTYPE html>
+<html lang="en">
+<head>
+    <meta charset="UTF-8">
+    <meta name="viewport" content="width=device-width, initial-scale=1.0">
+    <title>CIFAR-10 Image Classifier</title>
+    <meta name="description" content="Deep learning powered CIFAR-10 image classification using Convolutional Neural Networks">
+    <link rel="stylesheet" href="{{ url_for('static', filename='style.css') }}">
+    <link rel="preconnect" href="https://fonts.googleapis.com">
+    <link rel="preconnect" href="https://fonts.gstatic.com" crossorigin>
+    <link href="https://fonts.googleapis.com/css2?family=Inter:wght@300;400;500;600;700&display=swap" rel="stylesheet">
+</head>
+<body>
+    <div class="container">
+        <header class="header">
+            <div class="header-content">
+                <h1 class="title">
+                    <span class="gradient-text">CIFAR-10</span> Image Classifier
+                </h1>
+                <p class="subtitle">Powered by Deep Learning & Convolutional Neural Networks</p>
+            </div>
+        </header>
+        <main class="main-content">
+            <div class="upload-section">
+                <div class="card upload-card">
+                    <h2 class="card-title">Upload Image</h2>
+                    <div class="upload-area" id="uploadArea">
+                        <svg class="upload-icon" xmlns="http://www.w3.org/2000/svg" fill="none" viewBox="0 0 24 24" stroke="currentColor">
+                            <path stroke-linecap="round" stroke-linejoin="round" stroke-width="2" d="M7 16a4 4 0 01-.88-7.903A5 5 0 1115.9 6L16 6a5 5 0 011 9.9M15 13l-3-3m0 0l-3 3m3-3v12" />
+                        </svg>
+                        <p class="upload-text">Drag & drop an image here</p>
+                        <p class="upload-subtext">or click to browse</p>
+                        <input type="file" id="fileInput" accept="image/*" hidden>
+                    </div>
+                    <button class="btn btn-secondary" id="randomBtn">
+                        <svg xmlns="http://www.w3.org/2000/svg" fill="none" viewBox="0 0 24 24" stroke="currentColor">
+                            <path stroke-linecap="round" stroke-linejoin="round" stroke-width="2" d="M4 4v5h.582m15.356 2A8.001 8.001 0 004.582 9m0 0H9m11 11v-5h-.581m0 0a8.003 8.003 0 01-15.357-2m15.357 2H15" />
+                        </svg>
+                        Try Random Sample
+                    </button>
+                </div>
+                <div class="card preview-card" id="previewCard" style="display: none;">
+                    <h2 class="card-title">Preview</h2>
+                    <div class="image-preview">
+                        <img id="previewImage" src="" alt="Preview">
+                    </div>
+                    <button class="btn btn-primary" id="classifyBtn">
+                        <svg xmlns="http://www.w3.org/2000/svg" fill="none" viewBox="0 0 24 24" stroke="currentColor">
+                            <path stroke-linecap="round" stroke-linejoin="round" stroke-width="2" d="M9 12l2 2 4-4m6 2a9 9 0 11-18 0 9 9 0 0118 0z" />
+                        </svg>
+                        Classify Image
+                    </button>
+                </div>
+            </div>
+            <div class="results-section" id="resultsSection" style="display: none;">
+                <div class="card results-card">
+                    <h2 class="card-title">Classification Results</h2>
+                    <div class="prediction-main">
+                        <div class="prediction-label">Predicted Class</div>
+                        <div class="prediction-class" id="predictedClass">-</div>
+                        <div class="confidence-badge" id="confidenceBadge">
+                            <span id="confidenceValue">0%</span> confidence
+                        </div>
+                    </div>
+                    <div class="top-predictions">
+                        <h3 class="predictions-title">Top 5 Predictions</h3>
+                        <div id="top5Container"></div>
+                    </div>
+                </div>
+                <div class="card info-card">
+                    <h3 class="info-title">CIFAR-10 Classes</h3>
+                    <div class="classes-grid">
+                        {% for class_name in class_names %}
+                        <div class="class-item">
+                            <div class="class-icon">{{ loop.index0 }}</div>
+                            <div class="class-name">{{ class_name }}</div>
+                        </div>
+                        {% endfor %}
+                    </div>
+                </div>
+            </div>
+            <div class="loading-overlay" id="loadingOverlay" style="display: none;">
+                <div class="spinner"></div>
+                <p class="loading-text">Classifying image...</p>
+            </div>
+        </main>
+        <footer class="footer">
+            <p>Built with PyTorch & Flask | CNN Architecture with 3 Convolutional Blocks</p>
+        </footer>
+    </div>
+    <script src="{{ url_for('static', filename='script.js') }}"></script>
+</body>
+</html>

train.py ADDED Viewed

	@@ -0,0 +1,220 @@

+"""
+Training script for CIFAR-10 CNN
+"""
+import os
+import torch
+import torch.nn as nn
+import torch.optim as optim
+from tqdm import tqdm
+import matplotlib.pyplot as plt
+import config
+from model import get_model, count_parameters
+from data_loader import get_data_loaders
+from utils import save_checkpoint, load_checkpoint, plot_training_history
+def train_epoch(model, train_loader, criterion, optimizer, device):
+    """
+    Train the model for one epoch
+    Args:
+        model: PyTorch model
+        train_loader: Training data loader
+        criterion: Loss function
+        optimizer: Optimizer
+        device: Device to train on
+    Returns:
+        tuple: (average_loss, accuracy)
+    """
+    model.train()
+    running_loss = 0.0
+    correct = 0
+    total = 0
+    pbar = tqdm(train_loader, desc='Training')
+    for inputs, labels in pbar:
+        inputs, labels = inputs.to(device), labels.to(device)
+        # Zero the parameter gradients
+        optimizer.zero_grad()
+        # Forward pass
+        outputs = model(inputs)
+        loss = criterion(outputs, labels)
+        # Backward pass and optimize
+        loss.backward()
+        optimizer.step()
+        # Statistics
+        running_loss += loss.item()
+        _, predicted = outputs.max(1)
+        total += labels.size(0)
+        correct += predicted.eq(labels).sum().item()
+        # Update progress bar
+        pbar.set_postfix({
+            'loss': f'{running_loss / (pbar.n + 1):.4f}',
+            'acc': f'{100. * correct / total:.2f}%'
+        })
+    epoch_loss = running_loss / len(train_loader)
+    epoch_acc = 100. * correct / total
+    return epoch_loss, epoch_acc
+def validate(model, test_loader, criterion, device):
+    """
+    Validate the model
+    Args:
+        model: PyTorch model
+        test_loader: Test data loader
+        criterion: Loss function
+        device: Device to validate on
+    Returns:
+        tuple: (average_loss, accuracy)
+    """
+    model.eval()
+    running_loss = 0.0
+    correct = 0
+    total = 0
+    with torch.no_grad():
+        pbar = tqdm(test_loader, desc='Validation')
+        for inputs, labels in pbar:
+            inputs, labels = inputs.to(device), labels.to(device)
+            # Forward pass
+            outputs = model(inputs)
+            loss = criterion(outputs, labels)
+            # Statistics
+            running_loss += loss.item()
+            _, predicted = outputs.max(1)
+            total += labels.size(0)
+            correct += predicted.eq(labels).sum().item()
+            # Update progress bar
+            pbar.set_postfix({
+                'loss': f'{running_loss / (pbar.n + 1):.4f}',
+                'acc': f'{100. * correct / total:.2f}%'
+            })
+    epoch_loss = running_loss / len(test_loader)
+    epoch_acc = 100. * correct / total
+    return epoch_loss, epoch_acc
+def train():
+    """
+    Main training function
+    """
+    # Create directories
+    os.makedirs(config.CHECKPOINT_DIR, exist_ok=True)
+    os.makedirs(config.PLOTS_DIR, exist_ok=True)
+    # Get data loaders
+    print("Loading CIFAR-10 dataset...")
+    train_loader, test_loader = get_data_loaders()
+    print(f"Training samples: {len(train_loader.dataset)}")
+    print(f"Test samples: {len(test_loader.dataset)}")
+    # Create model
+    print(f"\nCreating model on device: {config.DEVICE}")
+    model = get_model(num_classes=config.NUM_CLASSES, device=config.DEVICE)
+    print(f"Model parameters: {count_parameters(model):,}")
+    # Loss function and optimizer
+    criterion = nn.CrossEntropyLoss()
+    optimizer = optim.SGD(
+        model.parameters(),
+        lr=config.LEARNING_RATE,
+        momentum=config.MOMENTUM,
+        weight_decay=config.WEIGHT_DECAY
+    )
+    # Learning rate scheduler
+    scheduler = None
+    if config.USE_SCHEDULER:
+        scheduler = optim.lr_scheduler.StepLR(
+            optimizer,
+            step_size=config.SCHEDULER_STEP_SIZE,
+            gamma=config.SCHEDULER_GAMMA
+        )
+    # Training history
+    history = {
+        'train_loss': [],
+        'train_acc': [],
+        'val_loss': [],
+        'val_acc': []
+    }
+    best_acc = 0.0
+    start_epoch = 0
+    # Training loop
+    print(f"\nStarting training for {config.EPOCHS} epochs...")
+    for epoch in range(start_epoch, config.EPOCHS):
+        print(f"\nEpoch {epoch + 1}/{config.EPOCHS}")
+        print("-" * 50)
+        # Train
+        train_loss, train_acc = train_epoch(
+            model, train_loader, criterion, optimizer, config.DEVICE
+        )
+        # Validate
+        val_loss, val_acc = validate(
+            model, test_loader, criterion, config.DEVICE
+        )
+        # Update learning rate
+        if scheduler:
+            scheduler.step()
+            current_lr = scheduler.get_last_lr()[0]
+            print(f"Learning rate: {current_lr:.6f}")
+        # Save history
+        history['train_loss'].append(train_loss)
+        history['train_acc'].append(train_acc)
+        history['val_loss'].append(val_loss)
+        history['val_acc'].append(val_acc)
+        # Print epoch summary
+        print(f"\nEpoch {epoch + 1} Summary:")
+        print(f"Train Loss: {train_loss:.4f} | Train Acc: {train_acc:.2f}%")
+        print(f"Val Loss: {val_loss:.4f} | Val Acc: {val_acc:.2f}%")
+        # Save best model
+        if val_acc > best_acc:
+            best_acc = val_acc
+            save_checkpoint(
+                model, optimizer, epoch, val_acc,
+                config.BEST_MODEL_PATH
+            )
+            print(f"✓ Best model saved with accuracy: {best_acc:.2f}%")
+        # Save last model
+        save_checkpoint(
+            model, optimizer, epoch, val_acc,
+            config.LAST_MODEL_PATH
+        )
+        # Plot training history
+        plot_training_history(history, config.PLOTS_DIR)
+    print("\n" + "=" * 50)
+    print(f"Training completed!")
+    print(f"Best validation accuracy: {best_acc:.2f}%")
+    print("=" * 50)
+if __name__ == '__main__':
+    train()

train_log.txt ADDED Viewed

The diff for this file is too large to render. See raw diff

utils.py ADDED Viewed

	@@ -0,0 +1,186 @@

+"""
+Utility functions for the CIFAR-10 CNN project
+"""
+import os
+import torch
+import matplotlib
+matplotlib.use('Agg')
+import matplotlib.pyplot as plt
+import numpy as np
+from sklearn.metrics import confusion_matrix, classification_report
+import seaborn as sns
+import config
+def save_checkpoint(model, optimizer, epoch, accuracy, filepath):
+    """
+    Save model checkpoint
+    Args:
+        model: PyTorch model
+        optimizer: Optimizer
+        epoch: Current epoch
+        accuracy: Current accuracy
+        filepath: Path to save checkpoint
+    """
+    checkpoint = {
+        'epoch': epoch,
+        'model_state_dict': model.state_dict(),
+        'optimizer_state_dict': optimizer.state_dict(),
+        'accuracy': accuracy
+    }
+    torch.save(checkpoint, filepath)
+def load_checkpoint(model, optimizer, filepath):
+    """
+    Load model checkpoint
+    Args:
+        model: PyTorch model
+        optimizer: Optimizer
+        filepath: Path to checkpoint file
+    Returns:
+        tuple: (epoch, accuracy)
+    """
+    checkpoint = torch.load(filepath, map_location=config.DEVICE)
+    model.load_state_dict(checkpoint['model_state_dict'])
+    if optimizer:
+        optimizer.load_state_dict(checkpoint['optimizer_state_dict'])
+    epoch = checkpoint['epoch']
+    accuracy = checkpoint['accuracy']
+    return epoch, accuracy
+def plot_training_history(history, save_dir):
+    """
+    Plot training history
+    Args:
+        history: Dictionary containing training history
+        save_dir: Directory to save plots
+    """
+    fig, (ax1, ax2) = plt.subplots(1, 2, figsize=(15, 5))
+    # Plot loss
+    ax1.plot(history['train_loss'], label='Train Loss', linewidth=2)
+    ax1.plot(history['val_loss'], label='Validation Loss', linewidth=2)
+    ax1.set_xlabel('Epoch', fontsize=12)
+    ax1.set_ylabel('Loss', fontsize=12)
+    ax1.set_title('Training and Validation Loss', fontsize=14, fontweight='bold')
+    ax1.legend(fontsize=10)
+    ax1.grid(True, alpha=0.3)
+    # Plot accuracy
+    ax2.plot(history['train_acc'], label='Train Accuracy', linewidth=2)
+    ax2.plot(history['val_acc'], label='Validation Accuracy', linewidth=2)
+    ax2.set_xlabel('Epoch', fontsize=12)
+    ax2.set_ylabel('Accuracy (%)', fontsize=12)
+    ax2.set_title('Training and Validation Accuracy', fontsize=14, fontweight='bold')
+    ax2.legend(fontsize=10)
+    ax2.grid(True, alpha=0.3)
+    plt.tight_layout()
+    plt.savefig(os.path.join(save_dir, 'training_history.png'), dpi=300, bbox_inches='tight')
+    plt.close()
+def plot_confusion_matrix(y_true, y_pred, save_path):
+    """
+    Plot confusion matrix
+    Args:
+        y_true: True labels
+        y_pred: Predicted labels
+        save_path: Path to save the plot
+    """
+    cm = confusion_matrix(y_true, y_pred)
+    plt.figure(figsize=(12, 10))
+    sns.heatmap(
+        cm, annot=True, fmt='d', cmap='Blues',
+        xticklabels=config.CLASS_NAMES,
+        yticklabels=config.CLASS_NAMES,
+        cbar_kws={'label': 'Count'}
+    )
+    plt.xlabel('Predicted Label', fontsize=12)
+    plt.ylabel('True Label', fontsize=12)
+    plt.title('Confusion Matrix', fontsize=14, fontweight='bold')
+    plt.tight_layout()
+    plt.savefig(save_path, dpi=300, bbox_inches='tight')
+    plt.close()
+def print_classification_report(y_true, y_pred):
+    """
+    Print classification report
+    Args:
+        y_true: True labels
+        y_pred: Predicted labels
+    """
+    report = classification_report(
+        y_true, y_pred,
+        target_names=config.CLASS_NAMES,
+        digits=4
+    )
+    print("\nClassification Report:")
+    print("=" * 80)
+    print(report)
+    print("=" * 80)
+def visualize_predictions(model, test_loader, device, num_images=16):
+    """
+    Visualize model predictions
+    Args:
+        model: PyTorch model
+        test_loader: Test data loader
+        device: Device to run on
+        num_images: Number of images to visualize
+    """
+    model.eval()
+    # Get a batch of images
+    images, labels = next(iter(test_loader))
+    images, labels = images[:num_images], labels[:num_images]
+    images_device = images.to(device)
+    # Get predictions
+    with torch.no_grad():
+        outputs = model(images_device)
+        _, predicted = outputs.max(1)
+    # Plot
+    fig, axes = plt.subplots(4, 4, figsize=(12, 12))
+    axes = axes.ravel()
+    for idx in range(num_images):
+        # Denormalize image
+        img = images[idx].cpu().numpy().transpose(1, 2, 0)
+        mean = np.array([0.4914, 0.4822, 0.4465])
+        std = np.array([0.2470, 0.2435, 0.2616])
+        img = img * std + mean
+        img = np.clip(img, 0, 1)
+        # Plot
+        axes[idx].imshow(img)
+        axes[idx].axis('off')
+        true_label = config.CLASS_NAMES[labels[idx]]
+        pred_label = config.CLASS_NAMES[predicted[idx].cpu()]
+        color = 'green' if labels[idx] == predicted[idx].cpu() else 'red'
+        axes[idx].set_title(
+            f'True: {true_label}\nPred: {pred_label}',
+            color=color, fontsize=10
+        )
+    plt.tight_layout()
+    plt.savefig(os.path.join(config.PLOTS_DIR, 'predictions.png'), dpi=300, bbox_inches='tight')
+    plt.close()
+    print(f"Predictions visualization saved to {config.PLOTS_DIR}/predictions.png")