Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,47 @@
|
|
| 1 |
-
---
|
| 2 |
-
license: mit
|
| 3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: mit
|
| 3 |
+
tags:
|
| 4 |
+
- gan
|
| 5 |
+
- pytorch
|
| 6 |
+
- vision
|
| 7 |
+
- cats
|
| 8 |
+
- dcgan
|
| 9 |
+
metrics:
|
| 10 |
+
- loss
|
| 11 |
+
datasets:
|
| 12 |
+
- huggan/cats
|
| 13 |
+
---
|
| 14 |
+
|
| 15 |
+
# CatGen v2 - 128px DCGAN
|
| 16 |
+
|
| 17 |
+
This model is a Deep Convolutional Generative Adversarial Network (DCGAN) trained to generate high-quality 128x128 images of cats. It was trained for 165 epochs on a curated dataset of feline images, pushing the boundaries of traditional GAN architectures at this resolution.
|
| 18 |
+
|
| 19 |
+
## Sample
|
| 20 |
+
Here's a sample after epoch 165:
|
| 21 |
+

|
| 22 |
+
|
| 23 |
+
## Model Details
|
| 24 |
+
- **Architecture:** DCGAN (Deep Convolutional GAN)
|
| 25 |
+
- **Resolution:** 128x128 pixels (RGB)
|
| 26 |
+
- **Parameters:** ~186M (Generator)
|
| 27 |
+
- **Training Duration:** ~5 hours on NVIDIA T4 GPU
|
| 28 |
+
- **Framework:** PyTorch with Mixed Precision (AMP)
|
| 29 |
+
|
| 30 |
+
## Training Hyperparameters
|
| 31 |
+
- **Batch Size:** 128
|
| 32 |
+
- **Learning Rate:** 0.0002
|
| 33 |
+
- **Optimizer:** Adam (Beta1: 0.5, Beta2: 0.999)
|
| 34 |
+
- **Latent Vector (Z):** 128 dimensions
|
| 35 |
+
|
| 36 |
+
## Intended Use
|
| 37 |
+
This model is intended for artistic and research purposes. It demonstrates how GANs can capture complex textures like fur and eye reflections at medium resolutions.
|
| 38 |
+
|
| 39 |
+
## How to use
|
| 40 |
+
To use this model, clone this repository and run the provided inference script. Ensure you have `matplotlib`, `torch` and `torchvision` installed.
|
| 41 |
+
|
| 42 |
+
```bash
|
| 43 |
+
python3 inference.py
|
| 44 |
+
```
|
| 45 |
+
|
| 46 |
+
## Limitations & Bias
|
| 47 |
+
As a GAN, the model might occasionally produce "dream-like" artifacts or distorted anatomy (e.g., extra ears or eyes). It is not a diffusion model and generates images in a single forward pass.
|