CNN / README.md

Upload folder using huggingface_hub

233caeb verified 11 days ago

4.65 kB

	---
	license: mit
	datasets:
	- cifar10
	metrics:
	- accuracy
	library_name: pytorch
	tags:
	- image-classification
	- sequence-classification
	---

	# CIFAR-10 RNN Image Classifier

	An end-to-end deep learning project for classifying CIFAR-10 images using a Recurrent Neural Network (LSTM) built with PyTorch. Includes a modern web interface for real-time image classification.

	## 🌟 Features

	- Custom RNN Architecture: Bidirectional LSTM layers with dropout
	- Complete Training Pipeline: Automated training with validation, checkpointing, and visualization
	- Comprehensive Evaluation: Confusion matrix, classification reports, and prediction visualizations
	- Modern Web Interface: Beautiful Flask web app for real-time image classification
	- CIFAR-10 Dataset: Automatically downloads and preprocesses the dataset

	## 📊 Model Architecture

	The model treats each 32x32 RGB image as a sequence of 32 rows, where each row has 96 features (32 pixels * 3 channels).

	```
	Input (Batch, 3, 32, 32)
	↓
	Reshape (Batch, 32, 96)
	↓
	Bidirectional LSTM (Hidden: 256, Layers: 2, Dropout: 0.2)
	↓
	Last Time Step Output
	↓
	Fully Connected (512) → ReLU → Dropout(0.3)
	↓
	Output (10 classes)
	```

	## 🚀 Quick Start

	### 1. Install Dependencies

	```bash
	pip install -r requirements.txt
	```

	### 2. Train the Model

	```bash
	python train.py
	```

	This will:
	- Download the CIFAR-10 dataset automatically
	- Train the model for 50 epochs
	- Save checkpoints in `./checkpoints/`
	- Generate training plots in `./plots/`

	### 3. Evaluate the Model

	```bash
	python evaluate.py
	```

	This will:
	- Load the best model checkpoint
	- Evaluate on the test set
	- Generate confusion matrix
	- Create prediction visualizations

	### 4. Run the Web Application

	```bash
	python app.py
	```

	Then open your browser and navigate to `http://localhost:5000`

	## 📁 Project Structure

	```
	CNN/
	├── config.py # Configuration and hyperparameters
	├── data_loader.py # Data loading and preprocessing
	├── model.py # CNN model architecture
	├── train.py # Training script
	├── evaluate.py # Evaluation script
	├── utils.py # Utility functions
	├── app.py # Flask web application
	├── requirements.txt # Python dependencies
	├── templates/
	│ └── index.html # Web interface HTML
	├── static/
	│ ├── style.css # Web interface CSS
	│ └── script.js # Web interface JavaScript
	├── checkpoints/ # Model checkpoints (created during training)
	├── plots/ # Training visualizations (created during training)
	└── data/ # CIFAR-10 dataset (downloaded automatically)
	```

	## 🎯 CIFAR-10 Classes

	The model classifies images into 10 categories:
	1. Airplane
	2. Automobile
	3. Bird
	4. Cat
	5. Deer
	6. Dog
	7. Frog
	8. Horse
	9. Ship
	10. Truck

	## ⚙️ Configuration

	Edit `config.py` to customize:
	- Training: epochs, batch size, learning rate
	- Model: number of classes, architecture parameters
	- Data: augmentation settings, normalization values
	- Paths: checkpoint and plot directories

	## 📈 Training Details

	- Optimizer: SGD with momentum (0.9) and weight decay (5e-4)
	- Loss Function: Cross-Entropy Loss
	- Learning Rate: 0.001 with step decay
	- Batch Size: 128
	- Data Augmentation: Random crop and horizontal flip
	- Regularization: Batch normalization and dropout

	## 🎨 Web Interface Features

	- Drag & Drop: Upload images via drag-and-drop
	- Random Samples: Test with random CIFAR-10 images
	- Real-time Classification: Instant predictions with confidence scores
	- Top-5 Predictions: View probability distribution
	- Modern UI: Dark theme with smooth animations

	## 📊 Expected Performance

	With the default configuration, the model typically achieves:
	- Training Accuracy: ~90%
	- Validation Accuracy: ~85%

	## 🛠️ Requirements

	- Python 3.7+
	- PyTorch 2.0+
	- torchvision
	- Flask
	- NumPy
	- Matplotlib
	- scikit-learn
	- Pillow
	- tqdm

	## 📝 License

	This project is open source and available for educational purposes.

	## 🤝 Contributing

	Feel free to fork this project and submit pull requests for improvements!

	## 📧 Contact

	For questions or feedback, please open an issue on the repository.

	---

	Built with ❤️ using PyTorch and Flask