ZeroCool94 commited on
Commit
c1f91e2
·
1 Parent(s): af05ebe

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -9
README.md CHANGED
@@ -41,7 +41,7 @@ This model is still in its infancy and it's meant to be constantly updated and t
41
  - [vae.sygil_muse_v0.1.pt](https://huggingface.co/Sygil/Sygil-Muse/blob/main/vae.sygil_muse_v0.1.pt): Trained from scratch for 3.0M steps with **dim: 128** and **vq_codebook_size: 256**.
42
  - [maskgit.sygil_muse_v0.1.pt](https://huggingface.co/Sygil/Sygil-Muse/blob/main/maskgit.sygil_muse_v0.1.pt): Maskgit trained from the VAE for 3.46M steps
43
  - #### Beta:
44
- - [vae.932300.pt](https://huggingface.co/Sygil/Sygil-Muse/blob/main/vae.932300.pt): Trained from scratch for 932K steps and higher **vq_codebook_size** than before.
45
  - [maskgit.10000.pt](https://huggingface.co/Sygil/Sygil-Muse/blob/main/maskgit.10000.pt): Maskgit trained from the VAE for 10K steps
46
 
47
  Note: Checkpoints under the Beta section are updated daily or at least 3-4 times a week. While the beta checkpoints can be used as they are only the latest version is kept on the repo and the older checkpoints are removed when a new one
@@ -56,25 +56,21 @@ The model was trained on the following dataset:
56
  **Hardware and others**
57
  - **Hardware:** 1 x Nvidia RTX 3050 GPU
58
  - **Hours Trained:** NaN.
59
- - **Gradient Accumulations**: 1
60
  - **Batch:** 1
61
- - **Learning Rate:** 1e-04
62
- - **Learning Rate Scheduler:** `cosine_with_restarts`
63
  - **Optimizer:** Adam
64
- - **Weight Decay:** 1e-4
65
  - **Warmup Steps:** 10,000
66
  - **Number of Cycles:** 100
67
  - **Resolution/Image Size**: First trained at a resolution of 64x64, then increased to 256x256 and then to 512x512. Check the notes down below for more details on this.
68
  - **Dimension:** 128
69
  - **vq_codebook_size:** 8192
70
- - **Total Training Steps:** 932,300
71
 
72
  Note: On Muse we can change the image_size or resolution at any time without having to train the model from scratch again, this allows us to first train the model at low resolution using the same `dim` and `vq_codebook_size` to train faster and then we can increase the `image_size` and use a higher resolution once the model has trained enough.
73
 
74
  Developed by: [ZeroCool](https://github.com/ZeroCool940711) at [Sygil-Dev](https://github.com/Sygil-Dev/)
75
 
76
- ## Community Contributions:
77
- - [Chad Kensington (isamu isozaki)](https://github.com/isamu-isozaki/muse-maskgit-pytorch): Thanks for helping with the training scripts and improving the code for Muse.
78
-
79
  # License
80
  This model is open access and available to all, with a CreativeML Open RAIL++-M License further specifying rights and usage.
 
41
  - [vae.sygil_muse_v0.1.pt](https://huggingface.co/Sygil/Sygil-Muse/blob/main/vae.sygil_muse_v0.1.pt): Trained from scratch for 3.0M steps with **dim: 128** and **vq_codebook_size: 256**.
42
  - [maskgit.sygil_muse_v0.1.pt](https://huggingface.co/Sygil/Sygil-Muse/blob/main/maskgit.sygil_muse_v0.1.pt): Maskgit trained from the VAE for 3.46M steps
43
  - #### Beta:
44
+ - [vae.1245400.pt](https://huggingface.co/Sygil/Sygil-Muse/blob/main/vae.1245400.pt): Trained from scratch for 1.24M steps and higher **vq_codebook_size** than before.
45
  - [maskgit.10000.pt](https://huggingface.co/Sygil/Sygil-Muse/blob/main/maskgit.10000.pt): Maskgit trained from the VAE for 10K steps
46
 
47
  Note: Checkpoints under the Beta section are updated daily or at least 3-4 times a week. While the beta checkpoints can be used as they are only the latest version is kept on the repo and the older checkpoints are removed when a new one
 
56
  **Hardware and others**
57
  - **Hardware:** 1 x Nvidia RTX 3050 GPU
58
  - **Hours Trained:** NaN.
59
+ - **Gradient Accumulations**: 5
60
  - **Batch:** 1
61
+ - **Learning Rate:** 1e-4
62
+ - **Learning Rate Scheduler:** `constant_with_warmup`
63
  - **Optimizer:** Adam
 
64
  - **Warmup Steps:** 10,000
65
  - **Number of Cycles:** 100
66
  - **Resolution/Image Size**: First trained at a resolution of 64x64, then increased to 256x256 and then to 512x512. Check the notes down below for more details on this.
67
  - **Dimension:** 128
68
  - **vq_codebook_size:** 8192
69
+ - **Total Training Steps:** 1,245,400
70
 
71
  Note: On Muse we can change the image_size or resolution at any time without having to train the model from scratch again, this allows us to first train the model at low resolution using the same `dim` and `vq_codebook_size` to train faster and then we can increase the `image_size` and use a higher resolution once the model has trained enough.
72
 
73
  Developed by: [ZeroCool](https://github.com/ZeroCool940711) at [Sygil-Dev](https://github.com/Sygil-Dev/)
74
 
 
 
 
75
  # License
76
  This model is open access and available to all, with a CreativeML Open RAIL++-M License further specifying rights and usage.