LH-Tech-AI commited on
Commit
bfdf9fc
·
verified ·
1 Parent(s): c5d3855

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +47 -3
README.md CHANGED
@@ -1,3 +1,47 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ tags:
4
+ - gan
5
+ - pytorch
6
+ - vision
7
+ - cats
8
+ - dcgan
9
+ metrics:
10
+ - loss
11
+ datasets:
12
+ - huggan/cats
13
+ ---
14
+
15
+ # CatGen v2 - 128px DCGAN
16
+
17
+ This model is a Deep Convolutional Generative Adversarial Network (DCGAN) trained to generate high-quality 128x128 images of cats. It was trained for 165 epochs on a curated dataset of feline images, pushing the boundaries of traditional GAN architectures at this resolution.
18
+
19
+ ## Sample
20
+ Here's a sample after epoch 165:
21
+ ![__results___8_0](https://cdn-uploads.huggingface.co/production/uploads/697f2832c2c5e4daa93cece7/VV8AhZgJFA_dvsV1-ul7P.png)
22
+
23
+ ## Model Details
24
+ - **Architecture:** DCGAN (Deep Convolutional GAN)
25
+ - **Resolution:** 128x128 pixels (RGB)
26
+ - **Parameters:** ~186M (Generator)
27
+ - **Training Duration:** ~5 hours on NVIDIA T4 GPU
28
+ - **Framework:** PyTorch with Mixed Precision (AMP)
29
+
30
+ ## Training Hyperparameters
31
+ - **Batch Size:** 128
32
+ - **Learning Rate:** 0.0002
33
+ - **Optimizer:** Adam (Beta1: 0.5, Beta2: 0.999)
34
+ - **Latent Vector (Z):** 128 dimensions
35
+
36
+ ## Intended Use
37
+ This model is intended for artistic and research purposes. It demonstrates how GANs can capture complex textures like fur and eye reflections at medium resolutions.
38
+
39
+ ## How to use
40
+ To use this model, clone this repository and run the provided inference script. Ensure you have `matplotlib`, `torch` and `torchvision` installed.
41
+
42
+ ```bash
43
+ python3 inference.py
44
+ ```
45
+
46
+ ## Limitations & Bias
47
+ As a GAN, the model might occasionally produce "dream-like" artifacts or distorted anatomy (e.g., extra ears or eyes). It is not a diffusion model and generates images in a single forward pass.