GAN_Implementation / README.md

Upload folder using huggingface_hub

c403b10 verified 7 months ago

5.15 kB

	# GAN Implementation - MNIST Digit Generation

	A comprehensive implementation of Generative Adversarial Networks (GANs) for generating MNIST handwritten digits using PyTorch.

	## 🔥 Features

	- Complete GAN Implementation: Both standard and optimized versions
	- MNIST Digit Generation: Generate realistic handwritten digits
	- Multiple Training Modes: Standard and lite modes for different performance needs
	- Comprehensive Logging: Detailed training logs and progress tracking
	- GPU Support: MPS (Apple Silicon), CUDA, and CPU support
	- Visualization: Real-time training progress and generated samples

	## 📊 Results

	The implementation successfully generates realistic MNIST digits with:
	- Generator Parameters: 576K (lite) / 3.5M (standard)
	- Discriminator Parameters: 533K (lite) / 2.7M (standard)
	- Training Time: ~5 minutes (lite mode) / ~30 minutes (standard)

	## 🚀 Quick Start

	### Installation

	```bash
	# Clone the repository
	git clone https://github.com/GruheshKurra/GAN_Implementation.git
	cd GAN_Implementation

	# Install dependencies
	pip install -r requirements.txt
	```

	### Usage

	1. Open the Jupyter Notebook:
	```bash
	jupyter notebook Gan.ipynb
	```

	2. Run the cells to train the GAN and generate digits

	3. Choose your mode:
	- Standard Mode: Full implementation with detailed logging
	- Lite Mode: Optimized for faster training and lower resource usage

	## 📁 Project Structure

	```
	GAN_Implementation/
	├── Gan.ipynb # Main implementation notebook
	├── requirements.txt # Python dependencies
	├── README.md # This file
	├── Generative Adversarial Networks (GANs).md # Theory and documentation
	├── gan_training.log # Training logs (standard mode)
	├── gan_training_lite.log # Training logs (lite mode)
	├── generator_lite.pth # Saved model weights
	└── data/ # MNIST dataset
	└── MNIST/
	└── raw/ # Raw MNIST data files
	```

	## 🧠 Implementation Details

	### Architecture

	Generator Network:
	- Input: Random noise vector (100D standard / 64D lite)
	- Hidden layers with ReLU/BatchNorm activation
	- Output: 784D vector (28x28 MNIST image)
	- Activation: Tanh (output range [-1, 1])

	Discriminator Network:
	- Input: 784D image vector
	- Hidden layers with LeakyReLU/Dropout
	- Output: Single probability (real vs fake)
	- Activation: Sigmoid

	### Training Process

	1. Data Preparation: MNIST dataset normalized to [-1, 1]
	2. Adversarial Training:
	- Discriminator learns to distinguish real vs fake images
	- Generator learns to fool the discriminator
	3. Loss Function: Binary Cross-Entropy Loss
	4. Optimization: Adam optimizer with β₁=0.5, β₂=0.999

	## 📈 Training Modes

	### Standard Mode
	- Latent Dimension: 100
	- Epochs: 50-100
	- Batch Size: 64-128
	- Dataset: Full MNIST (60K samples)
	- Best for: High-quality results

	### Lite Mode
	- Latent Dimension: 64
	- Epochs: 50
	- Batch Size: 64
	- Dataset: Subset (10K samples)
	- Best for: Quick experimentation and testing

	## 🔧 Technical Features

	- Device Auto-Detection: Automatically uses MPS, CUDA, or CPU
	- Memory Optimization: Efficient memory usage with cache clearing
	- Progress Tracking: Real-time loss monitoring and sample generation
	- Model Persistence: Save/load trained models
	- Comprehensive Logging: Detailed training metrics and timing

	## 📊 Performance Metrics

	\| Mode \| Training Time \| Generator Loss \| Discriminator Loss \| Quality \|
	\|------\|---------------\|----------------\|-------------------\|---------\|
	\| Standard \| ~30 min \| ~1.5 \| ~0.7 \| High \|
	\| Lite \| ~5 min \| ~2.0 \| ~0.6 \| Good \|

	## 🎯 Use Cases

	- Educational: Learn GAN fundamentals with working code
	- Research: Baseline for GAN experiments
	- Prototyping: Quick testing of GAN modifications
	- Production: Scalable digit generation system

	## 🔗 Links & Resources

	- GitHub Repository: [https://github.com/GruheshKurra/GAN_Implementation](https://github.com/GruheshKurra/GAN_Implementation)
	- Hugging Face: [https://huggingface.co/karthik-2905/GAN_Implementation](https://huggingface.co/karthik-2905/GAN_Implementation)
	- Blog Post: [Coming Soon on daily.dev]
	- Theory Documentation: See `Generative Adversarial Networks (GANs).md`

	## 🛠️ Requirements

	- Python 3.7+
	- PyTorch 2.0+
	- torchvision 0.15+
	- matplotlib 3.5+
	- numpy 1.21+
	- jupyter 1.0+

	## 📝 License

	This project is open source and available under the MIT License.

	## 🤝 Contributing

	Contributions are welcome! Please feel free to submit a Pull Request.

	## 📞 Contact

	- Author: Karthik
	- GitHub: [@GruheshKurra](https://github.com/GruheshKurra)

	## 🙏 Acknowledgments

	- Original GAN paper by Ian Goodfellow et al.
	- PyTorch team for the excellent deep learning framework
	- MNIST dataset creators

	---

	⭐ If you find this implementation helpful, please give it a star!