Reasons for using two stage training, why first training with resolution of 384 instead of 1024?

#15

by zijun - opened Aug 14, 2023

Aug 14, 2023

It seems that the training have two stage：
stage 1: 20,000 steps with resolution of 384
stage 2: 20,000 steps with resolution of 1024

What is the reason for using 384 resolution training in stage 1, why not just training with resolution of 1024 for 40,000 steps?
Is there any ablation study or experiment report show that two stage training is necessary?

williamberman

Aug 28, 2023

No super rigorous reasons for doing the two stage training. We just found that 1024 helped with sample quality. iirc training on just 1024 was also sufficient

williamberman changed discussion status to closed Aug 28, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment