Update README.md

2fb19e5 verified 5 days ago

1.67 kB

license: mit
tags:
  - gan
  - pytorch
  - vision
  - dcgan
  - faces
  - humans
  - face
metrics:
  - loss
datasets:
  - SDbiaseval/faces

FaceGen v1 - 128px DCGAN

This model is a Deep Convolutional Generative Adversarial Network (DCGAN) trained to generate high-quality 128x128 images of human faces. It was trained for 250 epochs on a curated dataset of feline images, pushing the boundaries of traditional GAN architectures at this resolution.

Sample

Here's a sample after epoch 200:

Model Details

Architecture: DCGAN (Deep Convolutional GAN)
Resolution: 128x128 pixels (RGB)
Parameters: ~186M (Generator)
Training Duration: ~22 hours on NVIDIA RTX 5060 Ti 16GB
Framework: PyTorch with Mixed Precision (AMP)

Training Hyperparameters

Batch Size: 128
Learning Rate: 0.0002
Optimizer: Adam (Beta1: 0.5, Beta2: 0.999)
Latent Vector (Z): 128 dimensions

Training details

The full training code can be found as train.py in this repo. The training data we used was from HF: stable-bias/faces

Intended Use

This model is intended for artistic and research purposes. It demonstrates how GANs can capture complex faces and even eye reflections at medium resolutions.

How to use

To use this model, clone this repository and run the provided inference script. Ensure you have matplotlib, torch and torchvision installed.

python3 inference.py

Sample output: