Update README.md
Browse files
README.md
CHANGED
|
@@ -6,4 +6,8 @@ This model can encode 224x224 RGB image into 28x28x13bit (1274 bytes) latent. Th
|
|
| 6 |
|
| 7 |
12M params for Encoder + Decoder. Trained on LAION-Aesthetics V2 5+ for 60M images.
|
| 8 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 9 |
(still training. final checkpt will be better)
|
|
|
|
| 6 |
|
| 7 |
12M params for Encoder + Decoder. Trained on LAION-Aesthetics V2 5+ for 60M images.
|
| 8 |
|
| 9 |
+
Guided by https://huggingface.co/laion/CLIP-ViT-B-32-laion2B-s34B-b79K (it's great. better than OpenAI CLIP B/32) and https://github.com/dingkeyan93/DISTS.
|
| 10 |
+
|
| 11 |
+
No GAN loss. So probably the image is slightly blurred in some cases?
|
| 12 |
+
|
| 13 |
(still training. final checkpt will be better)
|