Create README.md
Browse files
README.md
ADDED
|
@@ -0,0 +1,31 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: cc-by-sa-3.0
|
| 3 |
+
datasets:
|
| 4 |
+
- mnist
|
| 5 |
+
---
|
| 6 |
+
|
| 7 |
+
[WGAN-GP](https://arxiv.org/abs/1704.00028) model trained on the [MNIST dataset](https://www.tensorflow.org/datasets/catalog/mnist) using [JAX in Colab](https://colab.research.google.com/drive/1RzQfrc4Xf_pvGJD2PaNJyaURLh0nO4Fp?usp=sharing).
|
| 8 |
+
|
| 9 |
+
| Real Images | Generated Images |
|
| 10 |
+
| ------- | -------- |
|
| 11 |
+
|  |  |
|
| 12 |
+
|
| 13 |
+
# Training Progression
|
| 14 |
+
<video width="50%" controls>
|
| 15 |
+
<source src="https://cdn-uploads.huggingface.co/production/uploads/649f9483d76ca0fe679011c2/EB8tZYvT3t7GXuO7XgbYq.mp4" type="video/mp4">
|
| 16 |
+
</video>
|
| 17 |
+
|
| 18 |
+
# Details
|
| 19 |
+
This model is based on [WGAN-GP](https://arxiv.org/abs/1704.00028).
|
| 20 |
+
|
| 21 |
+
The model was trained for ~9h40m on a GCE VM instance (n1-standard-4, 1 x NVIDIA T4).
|
| 22 |
+
|
| 23 |
+
The Critic consists of 4 Convolutional Layers with strides for downsampling, and Leaky ReLU activation. The critic does not use Batch Normalization or Dropout.
|
| 24 |
+
|
| 25 |
+
The Generator consists of 4 Transposed Convolutional Layers with ReLU activation and Batch Normalization.
|
| 26 |
+
|
| 27 |
+
The learning rate was kept constant at 1e-4 for the first 50,000 steps, which was followed by cosine annealing cycles with a peak LR of 1e-3.
|
| 28 |
+
|
| 29 |
+
The Lambda (gradient penalty coefficient) used was 10 (same as the original paper).
|
| 30 |
+
|
| 31 |
+
For more details, please refer to the [Colab Notebook](https://colab.research.google.com/drive/1RzQfrc4Xf_pvGJD2PaNJyaURLh0nO4Fp?usp=sharing).
|